Discussion on: Regression from scratch - Wine quality prediction

View post

Eric Alcaraz del Pico • May 28 '19

I have a problem:
correlations = df.corr()['quality'].drop('quality')
Keyerror : 'quality'
Some idea?

Govind Bisht • Aug 5 '19

Hey Eric,

Please pass the separator value while reading the CSV file.

ex:
df = pd.read_csv('winequality-red.csv', sep=";")

Apoorva Dave • Jun 2 '19

The dataframe into which you have read csv file should contain the column 'quality'.
correlations = df.corr()['quality'].drop('quality')
Here we are trying to find correlations between column quality and all the other columns other than quality. 'quality' is our target variable.

Eric Alcaraz del Pico • Jun 3 '19 • Edited

This my csv

I have this column named 'quality'.
This is my code:

And i am having the problem:

Help pls :,(

Apoorva Dave • Jun 4 '19

I see you have value for 'quality' column in the dataset but it is not being read properly. As you can see in the output row 2 and 3 are showing .... but values are present in the actual dataset. Can you try printing df['quality'] and see are there are blank values for it?

Govind Bisht • Aug 5 '19

It is reading all the columns as one column. You need to pass the separator while reading the CSV file.

ex:
df = pd.read_csv('winequality-red.csv', sep=";")