DEV Community

Cover image for What do you need to know to get into Data Science as a Beginner?
Pushpa Sree Potluri
Pushpa Sree Potluri

Posted on • Originally published at hackerheap.com

What do you need to know to get into Data Science as a Beginner?

Data Science is a combination of Programming & Statistics, so to be a data scientist you need to have knowledge of at least one programming language, preferably Python/R as there is a good amount of people/communities who use these languages to build their models.

For a complete beginner, Python is easy to learn. Some of the basic tools used in data science from Python stack.

Jupyter Notebooks - IDE
Pandas - library for data manipulation and analysis
Numpy - library for scientific computing
Matplotlib & Seaborn - library for data visualization
Scikit-Learn - library for machine learning

Good mathematical knowledge helps to make a better judgment while choosing a procedure (algorithm) based on the data available to you and also to diagnose the problems.

Alt Text

If you don't have time to go through the theory, start with a tutorial. Follow the tutorial step-by-step. After you complete a tutorial, apply what you learned to new datasets. You can find some sample datasets online (https://www.kaggle.com/datasets). If you try the same modeling on a new dataset, you might run into a new issue. Upon doing some research, you might discover data issues in the dataset like different formats, or missing values.

If you are looking for more resources https://www.coursera.org/, https://www.datacamp.com/ offers some good and free courses.

This blog is first posted on hackerheap.com

Top comments (8)

Collapse
 
simondodson profile image
SIMON DODSON

without formal training, look at the metrics that matter most to your task/EBITA/time, and find a way to improve this metric. etc, uitilise the tools that are accessible to you, g sheets is fine, if you can learn to move business units you're valuable ...

this is a fun and fast an easy way to get into data metrics

Collapse
 
mrsaeeddev profile image
Saeed Ahmad

Pushpa I have completed Data Science Courses from DataCamp and IBM but due to being an Electronics grad, I am unable to find a Data Science Job. What do you suggest to me?

Collapse
 
sreepotluri profile image
Pushpa Sree Potluri

Focus on showing your skills to employers. Build a portfolio of real-world projects to prove your competency. Hope it helps!

Collapse
 
mrsaeeddev profile image
Saeed Ahmad

Thanks.

Collapse
 
jbradford77 profile image
Jennifer Bradford

Do you know of any degree programs that teach the skills needed to get into data science?

Collapse
 
domclone profile image
Dom

Check out coursera or edx. Both offer online masters degrees in data science from top universities, without the requirement of having a CS background.

Collapse
 
sreepotluri profile image
Pushpa Sree Potluri

I'm not sure of any degree programs that you can attend, but udacity have some great nanodegree programs udacity.com/school-of-data-science if you like! I hope it helps!

Collapse
 
mandaputtra profile image
Manda Putra • Edited

"diagnose the problem"

what kind of problem? Do data science encounterm