DEV Community

Discussion on: Efficiently Streaming a Large AWS S3 File via S3 Select

Collapse
 
idrisrampurawala profile image
Idris Rampurawala

Hi,
Glad that you liked the post and it helped you in your use-case.
With this process of streaming the data, you have to keep retrieving the file chunk from S3 until you reach the total file size. I would recommend to clone this repo and compare with your local code to identify if you missed something 😉

Optionally, I would recommend to also check out the sequel to this post for parallel processing 😁

Collapse
 
visantanna profile image
Vinícius

Thank you! I found out what I was missing, I made the start_byte = end_byte + 1. Losing one row per chunk. Your next article was exact what I was looking for for the next step of my program.