Pandas is a popular data manipulation and analysis library in Python. It offers data structures and functions needed to work with structured data seamlessly.
Installing Pandas
Install Pandas using pip:
pip install pandas
Importing Pandas
Import the library into your script:
import pandas as pd
Creating DataFrames
Create a DataFrame from a dictionary:
data = {
'Name': ['Alice', 'Bob', 'Charlie'],
'Age': [25, 30, 35]
}
df = pd.DataFrame(data)
print(df)
Reading Data from a File
Read data from a CSV file:
df = pd.read_csv('data.csv')
Data Exploration
Get a summary of your data:
print(df.head())
print(df.describe())
Data Manipulation
Filter data based on a condition:
filtered_df = df[df['Age'] > 28]
Conclusion
Pandas is an essential tool for data scientists and analysts working in Python. It simplifies the process of data manipulation, allowing you to focus on extracting insights.