DEV Community

Discussion on: TensorFlow to filter PDF files

Collapse
 
lukaszkuczynski profile image
lukaszkuczynski • Edited

I think its great with stripping data from pdf first.
I was thinking how to employ Elasticsearch some time ago..
There are ways how to easily index PDF docs, then you can use similarity or scoring to search.