Best way to validate content of the entire input data file is consistent!!

github logo ・1 min read

In machine learning, preparing data is one of the key step. Below is the simple and best way to check input data file is formatted properly. Below command should always return one unique value other wise file is not formatted properly.

cat file_name | awk -F',' '{print NF}' | sort -u

Refer original post for more details.

twitter logo DISCUSS
Classic DEV Post from Apr 3

Workspace Wednesday: Show off your desk/computer setup!

Let’s share pics of everybody’s setups. Feel free to add details about your ha...

chanduthedev profile image
I am a developer

Be a better developer. Free forever.