HyperAIHyperAI超神经
首页资讯最新论文教程数据集百科SOTALLM 模型天梯GPU 天梯顶会
全站搜索
关于
中文
HyperAIHyperAI超神经
  1. 首页
  2. SOTA
  3. 零样本视频问答
  4. Zero Shot Video Question Answer On Egoschema

Zero Shot Video Question Answer On Egoschema

评估指标

Accuracy

评测结果

各个模型在此基准测试上的表现结果

模型名称
Accuracy
Paper TitleRepository
VideoChat2_HD_mistral65.6MVBench: A Comprehensive Multi-modal Video Understanding Benchmark-
MVU (13B)60.3Understanding Long Videos with Multimodal Language Models-
Random20.0--
LangRepo (12B)66.2Language Repository for Long Video Understanding-
LLoVi (7B)50.8A Simple LLM Framework for Long-Range Video Question-Answering-
SlowFast-LLaVA-34B47.2SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models-
LLoVi (GPT-3.5)57.6A Simple LLM Framework for Long-Range Video Question-Answering-
Tarsier (34B)68.6Tarsier: Recipes for Training and Evaluating Large Video Description Models-
SeViLA (4B)25.7Self-Chained Image-Language Model for Video Localization and Question Answering-
LVNet66.0Too Many Frames, Not All Useful: Efficient Strategies for Long-Form Video QA-
TS-LLaVA-34B57.8TS-LLaVA: Constructing Visual Tokens through Thumbnail-and-Sampling for Training-Free Video Large Language Models-
VideoTree (GPT4)66.2VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos-
VideoChat2_mistral63.6MVBench: A Comprehensive Multi-modal Video Understanding Benchmark-
0 of 13 row(s) selected.
HyperAI

学习、理解、实践,与社区一起构建人工智能的未来

中文

关于

关于我们数据集帮助

产品

资讯教程数据集百科

链接

TVM 中文Apache TVMOpenBayes

© HyperAI超神经

津ICP备17010941号-1京公网安备11010502038810号京公网安备11010502038810号
TwitterBilibili