Modeling Social Interactions From Multimodal Signals