DEV Community πŸ‘©β€πŸ’»πŸ‘¨β€πŸ’»

Zaynaib (Ola) Giwa
Zaynaib (Ola) Giwa

Posted on • Updated on

Day 1 - 66 Days of Data Science

For the 66 days of data science journey, I am going back to the basics and relearning statistics. I am using a textbook titled Introduction to The Practice of Statistics Sixth Editionβ€Š-β€ŠMoore, McCabe Craig. Here is what I learned so far.

Data are numbers with context. Before you do any statistical calculation and create data visualizations you need to start with the habit of forming a question. "What does the data tell me?"
The starting point to any statistical analysis is to master the art of examining data.

Person Age Weight
Buttercup 24 110
Bubbles 24 105
Blossom 24 107

This is a table that contains data.

Individuals also known as cases, observations, and rows. If you are into programming you can think of rows as objects. Each object is like a noun that describes a (person, place, or thing). Objects have characteristics called variables.

Variables are also known as columns.

When you plan to do an exploratory data analysis(EDA) ask yourself the following questions.

  1. Why? Is there a specific question that I want to be answered by looking at this data? What is the purpose of this data?

  2. Who? What population does this data describe?

  3. What? How many columns does this data set have? How are these variables defined?

Top comments (0)

Build Anything...


Use any Linode offering to create something for the DEV x Linode Hackathon 2022. A variety of prizes are up for grabs, inculding $1,000 USD. πŸ‘€

β†’ Join the Hackathon <-