Transformers.js
Run Multimodal AI in the Browser with Transformers.js
A hands-on look at building browser-based multimodal AI with Transformers.js—running image captioning and speech recognition entirely client-side with no server or API calls required.