Fireworks 
FireLLaVA 13B
fireworks/firellava-13b
A blazing fast vision-language model, FireLLaVA quickly understands both text and images. It achieves impressive chat skills in tests, and was designed to mimic multimodal GPT-4.
The first commercially permissive open source LLaVA model, trained entirely on open source LLM generated instruction following data.
Context Window
4,096