Please note, this is a STATIC archive of website www.w3resource.com from 19 Jul 2022, cach3.com does not collect or store any user information, there is no "phishing" involved.
w3resource

Pandas Practice Set-1: Exercises, Practice, Solution

[An editor is available at the bottom of the page to write and execute the scripts.]

Pandas Practice Set-1 [65 exercises with solution]

Diamonds:

This classic dataset contains the prices and other attributes of almost 54,000 diamonds. It's a great dataset for beginners learning to work with data analysis and visualization.

Pandas Dataset Exercises

Content

Column Name Description
price price in US dollars (\$326--\$18,823)
carat weight of the diamond (0.2--5.01)
cut quality of the cut (Fair, Good, Very Good, Premium, Ideal)
color diamond colour, from J (worst) to D (best)
clarity a measurement of how clear the diamond is (I1 (worst), SI2, SI1, VS2, VS1, VVS2, VVS1, IF (best))
x length in mm (0--10.74)
y width in mm (0--58.9)
z depth in mm (0--31.8)
depth total depth percentage = z / mean(x, y) = 2 * z / (x + y) (43--79)
table width of top of diamond relative to widest point (43--95)

Access dimond.csv

import pandas as pd
diamonds = pd.read_csv('https://raw.githubusercontent.com/mwaskom/seaborn-data/master/diamonds.csv')
print(diamonds)

Source: https://www.kaggle.com/shivam2503/diamonds

1. Write a Pandas program to read a csv file from a specified source and print the first 5 rows. Go to the editor
Click me to see the sample solution

2. Write a Pandas program to read a dataset from diamonds DataFrame and modify the default columns values and print the first 6 rows. Go to the editor
Click me to see the sample solution

3. Write a Pandas program to select a series from diamonds DataFrame. Print the content of the series. Go to the editor
Click me to see the sample solution

4. Write a Pandas program to create a new 'Quality -color' Series (use bracket notation to define the Series name) of the diamonds DataFrame. Go to the editor
Click me to see the sample solution

5. Write a Pandas program to find the number of rows and columns and data type of each column of diamonds Dataframe. Go to the editor
Click me to see the sample solution

6. Write a Pandas program to summarize only 'object' columns of the diamonds Dataframe. Go to the editor
Click me to see the sample solution

7. Write a Pandas program to rename two of the columns of the diamonds Dataframe. Go to the editor
Click me to see the sample solution

8. Write a Pandas program to rename all the columns of the diamonds Dataframe. Go to the editor
Click me to see the sample solution

9. Write a Pandas program to remove the second column of the diamonds Dataframe. Go to the editor
Click me to see the sample solution

10. Write a Pandas program to remove multiple columns at once of the diamonds Dataframe. Go to the editor
Click me to see the sample solution

11. Write a Pandas program to remove multiple rows at once (axis=0 refers to rows) from diamonds dataframe. Go to the editor
Click me to see the sample solution

12. Write a Pandas program to sort the 'cut' Series in ascending order (returns a Series) of diamonds Dataframe. Go to the editor
Click me to see the sample solution

13. Write a Pandas program to sort the 'price' Series in descending order (returns a Series) of diamonds Dataframe. Go to the editor
Click me to see the sample solution

14. Write a Pandas program to sort the entire diamonds DataFrame by the 'carat' Series in ascending and descending order. Go to the editor
Click me to see the sample solution

15. Write a Pandas program to filter the DataFrame rows to only show carat weight at least 0.3. Go to the editor
Click me to see the sample solution

16. Write a Pandas program to convert a python list to pandas series. Go to the editor
Click me to see the sample solution

17. Write a Pandas program to find the details of the diamonds where length>5, width>5 and depth>5. Go to the editor
Click me to see the sample solution

18. Write a Pandas program to find the diamonds that are either Premium or Ideal. Go to the editor
Click me to see the sample solution

19. Write a Pandas program to find the diamonds that are with a Fair or Good or Premium. Go to the editor
Click me to see the sample solution

20. Write a Pandas program to display all column labels of diamonds DataFrame. Go to the editor
Click me to see the sample solution

21. Write a Pandas program to read only a subset of 3 rows from diamonds DataFrame. Go to the editor
Click me to see the sample solution

22. Write a Pandas program to iterate through diamonds DataFrame. Go to the editor
Click me to see the sample solution

23. Write a Pandas program to drop all non-numeric columns from diamonds DataFrame. Go to the editor
Click me to see the sample solution

24. Write a Pandas program to include only numeric columns in the diamonds DataFrame. Go to the editor
Click me to see the sample solution

25. Write a Pandas program to pass a list of data types to only describe certain types of diamonds DataFrame. Go to the editor
Click me to see the sample solution

26. Write a Pandas program to calculate the mean of each numeric column of diamonds DataFrame. Go to the editor
Click me to see the sample solution

27. Write a Pandas program to calculate the mean of each row of diamonds DataFrame. Go to the editor
Click me to see the sample solution

28. Write a Pandas program to calculate the mean of price for each cut of diamonds DataFrame. Go to the editor
Click me to see the sample solution

29. Write a Pandas program to calculate count, minimum, maximum price for each cut of diamonds DataFrame. Go to the editor
Click me to see the sample solution

30. Write a Pandas program to create a side-by-side bar plot of the diamonds DataFrame. Go to the editor
Click me to see the sample solution

31. Write a Pandas program to count how many times each value in cut series of diamonds DataFrame occurs. Go to the editor
Click me to see the sample solution

32. Write a Pandas program to display percentages of each value of cut series occurs in diamonds DataFrame. Go to the editor
Click me to see the sample solution

33. Write a Pandas program to display the unique values in cut series of diamonds DataFrame. Go to the editor
Click me to see the sample solution

34. Write a Pandas program to count the number of unique values in cut series of diamonds DataFrame. Go to the editor
Click me to see the sample solution

35. Write a Pandas program to compute a cross-tabulation of two Series in diamonds DataFrame. Go to the editor
Click me to see the sample solution

36. Write a Pandas program to calculate various summary statistics of cut series of diamonds DataFrame. Go to the editor
Click me to see the sample solution

37. Write a Pandas program to create a histogram of the 'carat' Series (distribution of a numerical variable) of diamonds DataFrame. Go to the editor
Click me to see the sample solution

38. Write a Pandas program to create a bar plot of the 'value_counts' for the 'cut' series of diamonds DataFrame. Go to the editor
Click me to see the sample solution

39. Write a Pandas program to create a DataFrame of booleans (True if missing, False if not missing) from diamonds DataFrame. Go to the editor
Click me to see the sample solution

40. Write a Pandas program to count the number of missing values in each Series of diamonds DataFrame. Go to the editor
Click me to see the sample solution

41. Write a Pandas program to check the number of rows and columns and drop those row if 'any' values are missing in a row of diamonds DataFrame. Go to the editor
Click me to see the sample solution

42. Write a Pandas program to drop a row if any or all values in a row are missing of diamonds DataFrame on two specific columns. Go to the editor
Click me to see the sample solution

43. Write a Pandas program to set an existing column as the index of diamonds DataFrame. Go to the editor
Click me to see the sample solution

44. Write a Pandas program to set an existing column as the index of diamonds DataFrame and restore the index name, and move the index back to a column. Go to the editor
Click me to see the sample solution

45. Write a Pandas program to access a specified Series index and the Series values of diamonds DataFrame. Go to the editor
Click me to see the sample solution

46. Write a Pandas program to sort a Series by its values and index of diamonds DataFrame. Go to the editor
Click me to see the sample solution

47. Write a Pandas program to calculate the multiply of length, width and depth for each cut of diamonds DataFrame. Go to the editor
Click me to see the sample solution

48. Write a Pandas program to concatenate the diamonds DataFrame with the 'color' Series. Go to the editor
Click me to see the sample solution

49. Write a Pandas program to read specified rows and all columns of diamonds DataFrame. Go to the editor
Click me to see the sample solution

50. Write a Pandas program to read rows 0, 5, 7 and all columns of diamonds DataFrame. Go to the editor
Click me to see the sample solution

51. Write a Pandas program to read rows 2 through 5 and all columns of diamonds DataFrame. Go to the editor
Click me to see the sample solution

52. Write a Pandas program to read rows 0 through 2 (inclusive), columns 'color' and 'price' of diamonds DataFrame. Go to the editor
Click me to see the sample solution

53. Write a Pandas program to read rows 0 through 2 (inclusive), columns 'color' through 'price' (inclusive) of diamonds DataFrame. Go to the editor
Click me to see the sample solution

54. Write a Pandas program to read rows in which the 'cut' is 'Premium', column 'color' of diamonds DataFrame. Go to the editor
Click me to see the sample solution

55. Write a Pandas program to read rows in positions 0 and 1, columns in positions 0 and 3 of diamonds DataFrame. Go to the editor
Click me to see the sample solution

56. Write a Pandas program to read rows in positions 0 through 4, columns in positions 1 through 4 of diamonds DataFrame. Go to the editor
Click me to see the sample solution

57. Write a Pandas program to read rows in positions 0 through 4 (exclusive) and all columns of diamonds DataFrame. Go to the editor
Click me to see the sample solution

58. Write a Pandas program to read rows 2 through 5 (inclusive), columns in positions 0 through 2 (exclusive) of diamonds DataFrame. Go to the editor
Click me to see the sample solution

59. Write a Pandas program to print a concise summary of diamonds DataFrame. Go to the editor
Click me to see the sample solution

60. Write a Pandas program to get the true memory usage by diamonds DataFrame. Go to the editor
Click me to see the sample solution

61. Write a Pandas program to calculate the memory usage for each Series (in bytes) of diamonds DataFrame. Go to the editor
Click me to see the sample solution

62. Write a Pandas program to get randomly sample rows from diamonds DataFrame. Go to the editor
Click me to see the sample solution

63. Write a Pandas program to get sample 75% of the diamonds DataFrame's rows without replacement and store the remaining 25% of the rows in another DataFrame. Go to the editor
Click me to see the sample solution

64. Write a Pandas program to read the diamonds DataFrame and detect duplicate color. Go to the editor
Note: duplicated () function returns boolean Series denoting duplicate rows, optionally only considering certain columns.
Click me to see the sample solution

65. Write a Pandas program to count the duplicate rows of diamonds DataFrame.
Click me to see the sample solution

Python Code Editor:


More to Come !

Do not submit any solution of the above exercises at here, if you want to contribute go to the appropriate exercise page.

[ Want to contribute to Python Pandas exercises? Send your code (attached with a .zip file) to us at w3resource[at]yahoo[dot]com. Please avoid copyrighted materials.]

Test your Python skills with w3resource's quiz



Python: Tips of the Day

Find current directory and file's directory:

To get the full path to the directory a Python file is contained in, write this in that file:

import os 
dir_path = os.path.dirname(os.path.realpath(__file__))

(Note that the incantation above won't work if you've already used os.chdir() to change your current working directory, since the value of the __file__ constant is relative to the current working directory and is not changed by an os.chdir() call.)

To get the current working directory use

import os
cwd = os.getcwd()

Documentation references for the modules, constants and functions used above:

  • The os and os.path modules.
  • The __file__ constant
  • os.path.realpath(path) (returns "the canonical path of the specified filename, eliminating any symbolic links encountered in the path")
  • os.path.dirname(path) (returns "the directory name of pathname path")
  • os.getcwd() (returns "a string representing the current working directory")
  • os.chdir(path) ("change the current working directory to path")

Ref: https://bit.ly/3fy0R6m