-
Can Large Language Models Understand Context?
Paper • 2402.00858 • Published • 24 -
OLMo: Accelerating the Science of Language Models
Paper • 2402.00838 • Published • 85 -
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 152 -
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
Paper • 2401.17072 • Published • 25
Collections
Discover the best community collections!
Collections including paper arxiv:2510.09558
-
Seedream 4.0: Toward Next-generation Multimodal Image Generation
Paper • 2509.20427 • Published • 82 -
Tree Search for LLM Agent Reinforcement Learning
Paper • 2509.21240 • Published • 92 -
SHANKS: Simultaneous Hearing and Thinking for Spoken Language Models
Paper • 2510.06917 • Published • 35 -
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models
Paper • 2510.04618 • Published • 129
-
meta-llama/CodeLlama-7b-Instruct-hf
Text Generation • 7B • Updated • 4.18k • 59 -
hamzab/roberta-fake-news-classification
Text Classification • Updated • 2.59k • • 9 -
Paper2Agent: Reimagining Research Papers As Interactive and Reliable AI Agents
Paper • 2509.06917 • Published • 43 -
Hulu-Med: A Transparent Generalist Model towards Holistic Medical Vision-Language Understanding
Paper • 2510.08668 • Published • 9
-
Towards Reasoning Era: A Survey of Long Chain-of-Thought for Reasoning Large Language Models
Paper • 2503.09567 • Published • 1 -
AutoPR: Let's Automate Your Academic Promotion!
Paper • 2510.09558 • Published • 53 -
The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning
Paper • 2601.06002 • Published • 53
-
Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models
Paper • 2505.10554 • Published • 120 -
Chain-of-Model Learning for Language Model
Paper • 2505.11820 • Published • 121 -
On Path to Multimodal Generalist: General-Level and General-Bench
Paper • 2505.04620 • Published • 82 -
Causal-Copilot: An Autonomous Causal Analysis Agent
Paper • 2504.13263 • Published • 7
-
LoRA+: Efficient Low Rank Adaptation of Large Models
Paper • 2402.12354 • Published • 7 -
The FinBen: An Holistic Financial Benchmark for Large Language Models
Paper • 2402.12659 • Published • 23 -
TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization
Paper • 2402.13249 • Published • 15 -
TrustLLM: Trustworthiness in Large Language Models
Paper • 2401.05561 • Published • 69
-
Can Large Language Models Understand Context?
Paper • 2402.00858 • Published • 24 -
OLMo: Accelerating the Science of Language Models
Paper • 2402.00838 • Published • 85 -
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 152 -
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
Paper • 2401.17072 • Published • 25
-
Towards Reasoning Era: A Survey of Long Chain-of-Thought for Reasoning Large Language Models
Paper • 2503.09567 • Published • 1 -
AutoPR: Let's Automate Your Academic Promotion!
Paper • 2510.09558 • Published • 53 -
The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning
Paper • 2601.06002 • Published • 53
-
Seedream 4.0: Toward Next-generation Multimodal Image Generation
Paper • 2509.20427 • Published • 82 -
Tree Search for LLM Agent Reinforcement Learning
Paper • 2509.21240 • Published • 92 -
SHANKS: Simultaneous Hearing and Thinking for Spoken Language Models
Paper • 2510.06917 • Published • 35 -
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models
Paper • 2510.04618 • Published • 129
-
Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models
Paper • 2505.10554 • Published • 120 -
Chain-of-Model Learning for Language Model
Paper • 2505.11820 • Published • 121 -
On Path to Multimodal Generalist: General-Level and General-Bench
Paper • 2505.04620 • Published • 82 -
Causal-Copilot: An Autonomous Causal Analysis Agent
Paper • 2504.13263 • Published • 7
-
meta-llama/CodeLlama-7b-Instruct-hf
Text Generation • 7B • Updated • 4.18k • 59 -
hamzab/roberta-fake-news-classification
Text Classification • Updated • 2.59k • • 9 -
Paper2Agent: Reimagining Research Papers As Interactive and Reliable AI Agents
Paper • 2509.06917 • Published • 43 -
Hulu-Med: A Transparent Generalist Model towards Holistic Medical Vision-Language Understanding
Paper • 2510.08668 • Published • 9
-
LoRA+: Efficient Low Rank Adaptation of Large Models
Paper • 2402.12354 • Published • 7 -
The FinBen: An Holistic Financial Benchmark for Large Language Models
Paper • 2402.12659 • Published • 23 -
TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization
Paper • 2402.13249 • Published • 15 -
TrustLLM: Trustworthiness in Large Language Models
Paper • 2401.05561 • Published • 69