Efficient And Scalable Large Multimodal Models