From Local Concepts to Universals: Evaluating the Multicultural Understanding of Vision-Language Models
VisualCOMET+: Visual Commonsense Generation & its incorporation into a Multimodal Topic Modeling algorithm