Discussion about this post

User's avatar
Suraj Pakala's avatar

Great read, valuable information about the real bottlenecks with egocentric data collection. I agree the long‑term goal should be to get as close as possible to well labeled, outcome‑linked egocentric data.

I’m not an expert in this space, but do we mostly write off unannotated egocentric footage that exists today? My intuition is that with newer training approaches (self-supervised learning, better VLMs, etc.), messy egocentric footage can be mined for patterns over time, especially in repetitive settings like kitchens and factories.

Sahil Khanna's avatar

first of all - fire read. I do think one big hurdle is missing in your argument about construction. You explained that trust and incentives are the main hurdle. However, I think the primary hurdle is a systemic issue with construction - the economic and structural features of the construction industry. Construction as an industry has brutally thin margins, is very dependent on contracted workers, and has a large undocumented workforce. This makes it that not collecting egocentric data is actually a very rational choice.

If they did collect data - 1. The compliance headache and costs for companies would increase massively, costs which fall very low on the priority list given that they already have thin margins. 2. The training and re-training costs would be super high for those who want their employees to capture good data (due to contracted employee churn). 3. It exposes undocumented workers to a risk of on-camera evidence. With that lens, I think breaking into the construction industry to collect egocentric data is much more than a trust issue, but more of an economic/structural issue. Let me know your thoughts.

2 more comments...

No posts

Ready for more?