Hi There! Welcome to Data 101.
NOTE:
I'll be using Google's Colab and Python in the article.
To learn how to read files, refer to the previous article in this series :
https://chrisdevcode.hashnode.dev/reading-data-from-files-using-python
Data Exploration
In Data Exploration, we look into finding the number of instances of records, number of columns and rows, the columns and data types in each column...
I will be using pandas library throughout.
I will be using airlines and airports CSV files.
To import the data:
import pandas as pd
airlines = pd.read_csv('airlines.csv')
airports = pd.read_csv('airports.csv')
Viewing First and Last few rows
To view the first 5 rows on the data set, we use .head()
from pandas.
airports.head()
You can use .head(x)
to get x number of rows from the top
To view the last 5 rows on the data set, we use .tail()
from pandas.
airports.tail()
Viewing Rows and Columns
To know the number of columns and rows in your data frame, use .shape
. It returns the dimensions of your data set:
airports.shape
To view the columns in the data frame, use .columns()
airports.columns
Viewing Data Types
To view datatypes of each data in each column, use .dtypes
:
airports.dtypes
Other tricks you could use are:
x.rows
x.info()
x.describe()
x.unique()
x.describe()
Top comments (0)