Frequently Asked Questions

Can I use the ImageNet images or the ImageNet pretrained models?

Can I use external images without human annotations?

Can I use the text data (tags, description, caption) in the WebVision dataset?

Can I use external text data, or models pretrained with external text data, with or without human annotation?

Note that the text data or models should be pulicly available. You should explicitly state in your final submission that what text datasets/models are used.

Can I crawl text data according to WebVision concepts by myself, and use it as training data?

If you have other questions, please drop an email to webvisionworkshop AT gmail.com