VFF - The signal in the noise

Data & Training

Datasets, synthetic data, training pipelines, and data governance for ML

New Multilingual Medical AI Benchmark Reveals Language and Vision Gaps

New Multilingual Medical AI Benchmark Reveals Language and Vision Gaps

Researchers have developed EuropeMedQA, a multilingual and multimodal medical examination dataset drawn from official…

by Francesco Andrea Causio, Vittorio De Vita, Olivia Riccomi, Michele Ferramola, Federico Felizzi, Alessandro Tosi, Antonio Cristiano, Lorenzo De Mori, Chiara Battipaglia, Melissa Sawaya, Luigi De Angelis, Marcello Di Pumpo, Alessandra Piscitelli, Pietro Eric Risuleo, Alessia Longo, Giulia Vojvodic, Mariapia Vassalli, Bianca Destro Castaniti, Nicol\`o Scarsi, Manuel Del Medicoabout 1 month ago· ArXiv (cs.AI)
Web Video as Training Data for 3D Scene Understanding

Web Video as Training Data for 3D Scene Understanding

Researchers demonstrate that unlabeled internet videos can be automatically processed into training data for 3D scene…

by Yixin Chen, Yaowei Zhang, Huangyue Yu, Junchao He, Yan Wang, Jiangyong Huang, Hongyu Shen, Junfeng Ni, Shaofei Wang, Baoxiong Jia, Song-Chun Zhu, Siyuan Huangabout 1 month ago· ArXiv (cs.AI)