Please note, this is a STATIC archive of website www.w3resource.com from 19 Jul 2022, cach3.com does not collect or store any user information, there is no "phishing" involved.
w3resource

Pandas: Extract data from MultiIndex levels DataFrame

Pandas Indexing: Exercise-14 with Solution

Write a Pandas program to extract a single row, rows and a specific value from a MultiIndex levels DataFrame.

Sample Solution:

Python Code :

import pandas as pd 
import numpy as np
sales_arrays = [['sale1', 'sale1', 'sale2', 'sale2', 'sale3', 'sale3', 'sale4', 'sale4'],
          ['city1', 'city2', 'city1', 'city2', 'city1', 'city2', 'city1', 'city2']]
sales_tuples = list(zip(*sales_arrays))
sales_index = pd.MultiIndex.from_tuples(sales_tuples, names=['sale', 'city'])
print(sales_tuples)
print("\nConstruct a Dataframe using the said MultiIndex levels: ")
df = pd.DataFrame(np.random.randn(8, 5), index=sales_index)
print(df)

print("\nExtract a single row from the said dataframe:")
print(df.loc[('sale2', 'city2')])
print("\nExtract a single row from the said dataframe:")
print(df.loc[('sale2', 'city2')])

print("\nExtract number of rows from the said dataframe:")
print(df.loc['sale1'])
print("\nExtract number of rows from the said dataframe:")
print(df.loc['sale3'])

print("\nExtract a single value from the said dataframe:")
print(df.loc[('sale1', 'city2'), 1])
print("\nExtract a single value from the said dataframe:")
print(df.loc[('sale4', 'city1'), 4])

Sample Output:

[('sale1', 'city1'), ('sale1', 'city2'), ('sale2', 'city1'), ('sale2', 'city2'), ('sale3', 'city1'), ('sale3', 'city2'), ('sale4', 'city1'), ('sale4', 'city2')]

Construct a Dataframe using the said MultiIndex levels: 
                    0         1         2         3         4
sale  city                                                   
sale1 city1  1.138551  0.507722 -0.870609 -0.186479 -1.038967
      city2 -0.002357  0.227624 -0.146152 -0.185473 -0.741184
sale2 city1 -1.307382  0.846347 -1.011645 -1.354593  2.208438
      city2  0.895843  0.350624  0.674705 -0.920561  0.610004
sale3 city1  0.571192  0.417562 -1.580535 -0.170085  1.258469
      city2  0.455347 -0.285652 -0.632070 -1.259128  0.710763
sale4 city1  0.178355  1.561962  1.627784 -0.097158  1.340233
      city2 -1.211935  0.256773  0.584134  1.505608 -1.559970

Extract a single row from the said dataframe:
0    0.895843
1    0.350624
2    0.674705
3   -0.920561
4    0.610004
Name: (sale2, city2), dtype: float64

Extract a single row from the said dataframe:
0    0.895843
1    0.350624
2    0.674705
3   -0.920561
4    0.610004
Name: (sale2, city2), dtype: float64

Extract number of rows from the said dataframe:
              0         1         2         3         4
city                                                   
city1  1.138551  0.507722 -0.870609 -0.186479 -1.038967
city2 -0.002357  0.227624 -0.146152 -0.185473 -0.741184

Extract number of rows from the said dataframe:
              0         1         2         3         4
city                                                   
city1  0.571192  0.417562 -1.580535 -0.170085  1.258469
city2  0.455347 -0.285652 -0.632070 -1.259128  0.710763

Extract a single value from the said dataframe:
0.22762367059081048

Extract a single value from the said dataframe:
1.340233465712309

Python Code Editor:


Have another way to solve this solution? Contribute your code (and comments) through Disqus.

Previous: Write a Pandas program to construct a DataFrame using the MultiIndex levels as the column and index.

Next: Write a Pandas program to rename names of columns and specific labels of the Main Index of the MultiIndex dataframe.

What is the difficulty level of this exercise?

Test your Programming skills with w3resource's quiz.



Python: Tips of the Day

Find current directory and file's directory:

To get the full path to the directory a Python file is contained in, write this in that file:

import os 
dir_path = os.path.dirname(os.path.realpath(__file__))

(Note that the incantation above won't work if you've already used os.chdir() to change your current working directory, since the value of the __file__ constant is relative to the current working directory and is not changed by an os.chdir() call.)

To get the current working directory use

import os
cwd = os.getcwd()

Documentation references for the modules, constants and functions used above:

  • The os and os.path modules.
  • The __file__ constant
  • os.path.realpath(path) (returns "the canonical path of the specified filename, eliminating any symbolic links encountered in the path")
  • os.path.dirname(path) (returns "the directory name of pathname path")
  • os.getcwd() (returns "a string representing the current working directory")
  • os.chdir(path) ("change the current working directory to path")

Ref: https://bit.ly/3fy0R6m