MBZUAI/videocom_training_dataset
Updated • 5
Natural Language Processing, Machine Learning, and Computer Vision
CarePilot: A Multi-Agent Framework for Long-Horizon Computer Task Automation in Healthcare
From Masks to Pixels and Meaning: A New Taxonomy, Benchmark, and Metrics for VLM Image Tampering