Return to Article Details Commonsense-based Visual-Linguistic Reasoning for Document Filtering using Multimodal Large Language Models Download Download PDF