DEV Community

Cover image for Notes on Data Science Fundamentals
kainat Raisa
kainat Raisa

Posted on

Notes on Data Science Fundamentals

Why Data Science?
Huge amount of data is produced everyday from virtual platforms, industrial zones, corporate offices, Scientific experiments etc. All of these data needs to be managed, organized, analyzed properly to be used for making better strategies for our businesses, organizations, research work etc. Data is a core part of the development as well as the revolution of 21st century. So we need a whole field where we will work,understand, discover with data and from data. This field of work and study is know as Data Science.

What is Data Science?
The science of getting insights from data(usually huge amount of data) by collecting, organizing summarizing, and visualizing them for better decision making is called Data Science.

Some Tools we use for Data Science :-

▪️Databases (MySQL, MongoDB etc.) : Used for storing data in a structured way.
▪️Spreadsheet softwares (MS Excel, Google Drive Spreadsheet etc.) : Used to prepare Data for analysis.
▪️ Data analysis tools (Python, R, MATLAB etc.) : Used for structure, analyze, visualize data.
▪️ Programming libraries and modules (Matplotlib, seaborn, plotly, Numpy,Pandas etc.) : Used for performing different operations on the Data.
▪️ Apache Spark : Used for big data loads.
▪️ Virtual Environments ( Jupyter Notebook, VSCode etc.) : Used to perform the operations on data using programming languages.

(There are many more tools which are widely used in the field of Data Science)

Data Science methodology( steps we follow for Data Science) :-
▪️Business Understanding
▪️Analytic Approach
▪️Data requirements
▪️Data Collection
▪️Data understanding
▪️Data preparation
▪️Modeling
▪️Evaluation
▪️Deployment
▪️Feedback

(All of the concepts mentioned above will be broadly explained in another blog)

Top comments (0)