Class Notes: Data Handling Using Pandas - I

IP (065)

Pandas is a super useful Python library for working with data—like a supercharged Excel for coders. It helps you:

Pandas has two main data containers:

What? A 1D labeled array (e.g., [10, 15, 18] with index labels [0, 1, 2]).
Features:
- Data can be changed (mutable), but size can’t (immutable).
- Index labels make data easy to access.

How to Create a Series:

python

Copy

import pandas as pd  
data = [10, 15, 18, 22]  
s = pd.Series(data, index=['a', 'b', 'c', 'd'])  
print(s)

Output:

Copy

Cool Tricks with Series:

What? A 2D table (rows + columns).
Features:
- Columns can hold different data types (numbers, text, etc.).
- Size and data can be changed (mutable).

How to Create a DataFrame:

python

Copy

data = {'Name': ['Alice', 'Bob'], 'Age': [25, 30]}  
df = pd.DataFrame(data)  
print(df)

Output:

Copy

   Name  Age  
0  Alice  25  
1   Bob  30

Working with DataFrames:

Add a column: df['Salary'] = [5000, 6000]
Delete a column: del df['Age'] or df.drop('Age', axis=1)
Select data:
- Single column: df['Name']
- Multiple columns: df[['Name', 'Salary']]
- Rows: df.loc[0:2] (by label) or df.iloc[0:2] (by position)

Filtering: df[df['Age'] > 25] (people older than 25).
Math: df['Salary'].sum() (total salary).
Merge/Join: Combine two DataFrames (like SQL joins).pythonCopydf1.merge(df2, on=’ID’, how=’inner’) # Keeps matching rows only.

Class Notes: Data Handling Using Pandas – I