Might have had better results with full-body images (assuming you could scrape nearly as many images). It might just be a US thing, but clothing-choices are usually a lot more indicative of tastes which, in turn, can influence areas of academic interest.


Yes, I agree! With full-body images, a model might even learn things like posture. There would be a lot more information to extract knowledge from. But probably you would need a different approach for gathering enough training data.

