Discussion on: Handwritten Digit Recognition Using Convolutional Neural Networks

View post

Replies for: Hi Raunak, glad you enjoyed the post :) I tried to make it not too theoretical but without some intuition about the math I find it hard to underst...

Raunak Ramakrishnan • May 29 '18 • Edited

Not sure of networks learning the pre-processing as most of the image pipelines I have seen try a lot of hit-and-miss steps with regards to pre-processing. I have seen people try thresholding images, use gradient or edge images, use RGB vs grayscale vs HSL. I think there is a lot of variability in pre-processing which makes it difficult for a network to learn. This is one case where having knowledge of your specific set and some knowledge of computer vision helps, otherwise, we will require a very large number of training images. If we have a small number of images, we can 'augment' the dataset by using image data generators which slightly change images by rotating/resizing/blurring/distorting,etc.

There are also LSTM networks which have a concept of memory but they are used more for speech recognition and time series. I haven't worked with these yet.

Frank Rosner • May 29 '18

Got LSTM on my To-Do list, already. Definately going to check them out!

Thanks Raunak!