DEV Community


Posted on

Best way to validate content of the entire input data file is consistent!!

In machine learning, preparing data is one of the key step. Below is the simple and best way to check input data file is formatted properly. Below command should always return one unique value other wise file is not formatted properly.

cat file_name | awk -F',' '{print NF}' | sort -u

Refer original post for more details.

Top comments (0)