vff — the signal in the noise

Multimodal

Vision-language models, audio AI, and cross-modal capabilities