DEV Community

loading...

Introduction to Data Science

sreepotluri profile image Pushpa Sree Potluri ・1 min read

Most of the people are under the misconception that data science is all about machine learning algorithms. That is not true. Data Science is a combination of mathematics, computer science and, machine learning.

Alt Text

Data Science is a study of data, where you maintain datsets and derive insights from the dataset. Data Science uses different parts mentioned in the pattern below to solve the problems.

Alt Text

Perception - try to identify patterns with the help of the data
Planning - involves two steps:

  1. Finding all possible solutions
  2. Finding the best possible solution among all solutions

What do you need to know to be a successful data scientist?

  1. Programming Knowledge
  2. Data modelling and evaluation
  3. Data Visualization and reporting
  4. Probability and Statistics
  5. Machine Learning techniques
  6. Relational Database knowledge

Let's get started with some basic terminology used in data science:

  1. Observations - data points in your dataset (rows)
  2. Features - variables in your dataset (columns)
  3. Target Variable - which you are trying to predict
  4. Train data - data from which your algorithm learns
  5. Test data - data to evaluate your model performance
  6. Model - set of patterns learned from the data
  7. Algorithm - specific machine learning process used to train your model

Discussion

pic
Editor guide