I am really curious about the data labelling. Do you mean that you took photos o...

I am really curious about the data labelling. Do you mean that you took photos of piles of Legos and labeled them by hand. Or individual Legos from different angles?

Also you mentioned data synthesis. How would this be possible? Unless your suggesting that you rendered photo realistic piles of Legos and used trained on them because if that is the case, please do a write up of the project. I can't imagine more interesting way to generate training data.