Anderson Braz

Posted on Apr 14, 2021 • Originally published at andersonbraz.com on Jul 28, 2020

Data Science in Python: Pandas Read Sources

#pythonbeginner #datascience #collaboration #pandas

In this post I show basic knowledge and notes for data science beginners. You will find in this post an link to Jupyter file with code and execution.

Pandas Basics

Pandas is a fast, powerful, flexible and easy to use open source data analysis and manipulation tool, built on top of the Python programming language.

Use the following import convention:

import pandas as pd

Important

Here I continue the content of the previous post Data Science in Python: Pandas Introduction

This post I consider three sources: CSV, XLSX and SQL Query

Read and Write CSV

pd.read_csv('origin-file.csv', header=None, nrows=5)
pd.to_csv('destin-file.csv')

Read and Write Excel

pd.read_excel('origin-sheet.xlsx')
pd.to_excel('destin-sheet.xlsx', sheet_name='Sheet1')

Read and Write to SQL Query or Database Table

from sqlahchemy import create_engine
engine = create_engine('sqlite:///:memory:')

pd.read_sql('SELECT * FROM my_table;', engine)
pd.read_sql_table('my_table', engine)
pd.read_sql_query('SELECT * FROM my_table;', engine)

Conclusion

Pandas is flexible and easy to use analysis and manipulation data with external sources.

DEV Community

Data Science in Python: Pandas Read Sources

Pandas Basics

Important

Read and Write CSV

Read and Write Excel

Read and Write to SQL Query or Database Table

Conclusion

See on Practice - Code and Execution

Credits

Top comments (0)

Read next

Announcing FiftyOne 0.23.7 and FiftyOne Teams 1.5.8

InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD

CodecLM: Aligning Language Models with Tailored Synthetic Data

ChatGPT Can Predict the Future when it Tells Stories Set in the Future About the Past