ずんだもんのHugging Faceニュース
Daily AI Papers Briefing (2026-03-21)
【本日の論文】
1. Generation Models Know Space: Unleashing Implicit 3D Priors for Scene Understanding
https://huggingface.co/papers/2603.19235
2. SAMA: Factorized Semantic Anchoring and Motion Alignment for Instruction-Guided Video Editing
https://huggingface.co/papers/2603.19228
3. FASTER: Rethinking Real-Time Flow VLAs
https://huggingface.co/papers/2603.19199
4. 3DreamBooth: High-Fidelity 3D Subject-Driven Video Generation Model
https://huggingface.co/papers/2603.18524
5. Bridging Semantic and Kinematic Conditions with Diffusion-based Discrete Motion Tokenizer
https://huggingface.co/papers/2603.19227
Daily AI Papers Briefing (2026-03-20)
【本日の論文】
1. MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild
https://huggingface.co/papers/2603.17187
2. Video-CoE: Reinforcing Video Event Prediction via Chain of Events
https://huggingface.co/papers/2603.14935
3. MosaicMem: Hybrid Spatial Memory for Controllable Video World Models
https://huggingface.co/papers/2603.17117
4. Alignment Makes Language Models Normative, Not Descriptive
https://huggingface.co/papers/2603.17218
5. Complementary Reinforcement Learning
https://huggingface.co/papers/2603.17621
Daily AI Papers Briefing (2026-03-19)
【本日の論文】
1. InCoder-32B: Code Foundation Model for Industrial Scenarios
https://huggingface.co/papers/2603.16790
2. MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification
https://huggingface.co/papers/2603.15726
3. Qianfan-OCR: A Unified End-to-End Model for Document Intelligence
https://huggingface.co/papers/2603.13398
4. Kinema4D: Kinematic 4D World Modeling for Spatiotemporal Embodied Simulation
https://huggingface.co/papers/2603.16669
5. Demystifing Video Reasoning
https://huggingface.co/papers/2603.16870
Daily AI Papers Briefing (2026-03-18)
【本日の論文】
1. AI Can Learn Scientific Taste
https://huggingface.co/papers/2603.14473
2. OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data
https://huggingface.co/papers/2603.15594
3. EnterpriseOps-Gym: Environments and Evaluations for Stateful Agentic Planning and Tool Use in Enterprise Settings
https://huggingface.co/papers/2603.13594
4. HSImul3R: Physics-in-the-Loop Reconstruction of Simulation-Ready Human-Scene Interactions
https://huggingface.co/papers/2603.15612
5. Grounding World Simulation Models in a Real-World Metropolis
https://huggingface.co/papers/2603.15583
Daily AI Papers Briefing (2026-03-17)
【本日の論文】
1. LMEB: Long-horizon Memory Embedding Benchmark
https://huggingface.co/papers/2603.12572
2. Cheers: Decoupling Patch Details from Semantic Representations Enables Unified Multimodal Comprehension and Generation
https://huggingface.co/papers/2603.12793
3. Can Vision-Language Models Solve the Shell Game?
https://huggingface.co/papers/2603.08436
4. daVinci-Env: Open SWE Environment Synthesis at Scale
https://huggingface.co/papers/2603.13023
5. OmniForcing: Unleashing Real-time Joint Audio-Visual Generation
https://huggingface.co/papers/2603.11647
Daily AI Papers Briefing (2026-03-16)
【本日の論文】
1. Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training
https://huggingface.co/papers/2603.12255
2. Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections
https://huggingface.co/papers/2603.12180
3. IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse
https://huggingface.co/papers/2603.12201
4. Video-Based Reward Modeling for Computer-Use Agents
https://huggingface.co/papers/2603.10178
5. ShotVerse: Advancing Cinematic Camera Control for Text-Driven Multi-Shot Video Creation
https://huggingface.co/papers/2603.11421
Daily AI Papers Briefing (2026-03-15)
【本日の論文】
1. Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training
https://huggingface.co/papers/2603.12255
2. Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections
https://huggingface.co/papers/2603.12180
3. IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse
https://huggingface.co/papers/2603.12201
4. Video-Based Reward Modeling for Computer-Use Agents
https://huggingface.co/papers/2603.10178
5. ShotVerse: Advancing Cinematic Camera Control for Text-Driven Multi-Shot Video Creation
https://huggingface.co/papers/2603.11421
Daily AI Papers Briefing (2026-03-14)
【本日の論文】
1. Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training
https://huggingface.co/papers/2603.12255
2. Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections
https://huggingface.co/papers/2603.12180
3. IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse
https://huggingface.co/papers/2603.12201
4. Video-Based Reward Modeling for Computer-Use Agents
https://huggingface.co/papers/2603.10178
5. DreamVideo-Omni: Omni-Motion Controlled Multi-Subject Video Customization with Latent Identity Reinforcement Learning
https://huggingface.co/papers/2603.12257
Daily AI Papers Briefing (2026-03-13)
【本日の論文】
1. OpenClaw-RL: Train Any Agent Simply by Talking
https://huggingface.co/papers/2603.10165
2. Flash-KMeans: Fast and Memory-Efficient Exact K-Means
https://huggingface.co/papers/2603.09229
3. MA-EgoQA: Question Answering over Egocentric Videos from Multiple Embodied Agents
https://huggingface.co/papers/2603.09827
4. LLM2Vec-Gen: Generative Embeddings from Large Language Models
https://huggingface.co/papers/2603.10913
5. ReMix: Reinforcement routing for mixtures of LoRAs in LLM finetuning
https://huggingface.co/papers/2603.10160
Daily AI Papers Briefing (2026-03-12)
【本日の論文】
1. Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing
https://huggingface.co/papers/2603.03143
2. Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs
https://huggingface.co/papers/2603.09906
3. Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion
https://huggingface.co/papers/2603.06577
4. MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data
https://huggingface.co/papers/2603.09206
5. InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing
https://huggingface.co/papers/2603.09877
Daily AI Papers Briefing (2026-03-11)
【本日の論文】
1. Lost in Stories: Consistency Bugs in Long Story Generation by LLMs
https://huggingface.co/papers/2603.05890
2. Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence
https://huggingface.co/papers/2603.07660
3. LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory
https://huggingface.co/papers/2603.03269
4. Believe Your Model: Distribution-Guided Confidence Calibration
https://huggingface.co/papers/2603.03872
5. How Far Can Unsupervised RLVR Scale LLM Training?
https://huggingface.co/papers/2603.08660
Daily AI Papers Briefing (2026-03-10)
【本日の論文】
1. Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders
https://huggingface.co/papers/2603.06569
2. BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning
https://huggingface.co/papers/2603.04918
3. Planning in 8 Tokens: A Compact Discrete Tokenizer for Latent World Model
https://huggingface.co/papers/2603.05438
4. WildActor: Unconstrained Identity-Preserving Video Generation
https://huggingface.co/papers/2603.00586
5. Progressive Residual Warmup for Language Model Pretraining
https://huggingface.co/papers/2603.05369
Daily AI Papers Briefing (2026-03-09)
【本日の論文】
1. MOOSE-Star: Unlocking Tractable Training for Scientific Discovery by Breaking the Complexity Barrier
https://huggingface.co/papers/2603.03756
2. SkillNet: Create, Evaluate, and Connect AI Skills
https://huggingface.co/papers/2603.04448
3. DARE: Aligning LLM Agents with the R Statistical Ecosystem via Distribution-Aware Retrieval
https://huggingface.co/papers/2603.04743
4. AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios
https://huggingface.co/papers/2602.23166
5. RoboPocket: Improve Robot Policies Instantly with Your Phone
https://huggingface.co/papers/2603.05504
Daily AI Papers Briefing (2026-03-08)
【本日の論文】
1. MOOSE-Star: Unlocking Tractable Training for Scientific Discovery by Breaking the Complexity Barrier
https://huggingface.co/papers/2603.03756
2. SkillNet: Create, Evaluate, and Connect AI Skills
https://huggingface.co/papers/2603.04448
3. DARE: Aligning LLM Agents with the R Statistical Ecosystem via Distribution-Aware Retrieval
https://huggingface.co/papers/2603.04743
4. AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios
https://huggingface.co/papers/2602.23166
5. RoboPocket: Improve Robot Policies Instantly with Your Phone
https://huggingface.co/papers/2603.05504
Daily AI Papers Briefing (2026-03-07)
【本日の論文】
1. MOOSE-Star: Unlocking Tractable Training for Scientific Discovery by Breaking the Complexity Barrier
https://huggingface.co/papers/2603.03756
2. SkillNet: Create, Evaluate, and Connect AI Skills
https://huggingface.co/papers/2603.04448
3. DARE: Aligning LLM Agents with the R Statistical Ecosystem via Distribution-Aware Retrieval
https://huggingface.co/papers/2603.04743
4. RoboPocket: Improve Robot Policies Instantly with Your Phone
https://huggingface.co/papers/2603.05504
5. AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios
https://huggingface.co/papers/2602.23166
Daily AI Papers Briefing (2026-03-06)
【本日の論文】
1. Helios: Real Real-Time Long Video Generation Model
https://huggingface.co/papers/2603.04379
2. T2S-Bench & Structure-of-Thought: Benchmarking and Prompting Comprehensive Text-to-Structure Reasoning
https://huggingface.co/papers/2603.03790
3. Heterogeneous Agent Collaborative Reinforcement Learning
https://huggingface.co/papers/2603.02604
4. Proact-VL: A Proactive VideoLLM for Real-Time AI Companions
https://huggingface.co/papers/2603.03447
5. MemSifter: Offloading LLM Memory Retrieval via Outcome-Driven Proxy Reasoning
https://huggingface.co/papers/2603.03379
Daily AI Papers Briefing (2026-03-05)
【本日の論文】
1. Utonia: Toward One Encoder for All Point Clouds
https://huggingface.co/papers/2603.03283
2. UniG2U-Bench: Do Unified Models Advance Multimodal Understanding?
https://huggingface.co/papers/2603.03241
3. Beyond Language Modeling: An Exploration of Multimodal Pretraining
https://huggingface.co/papers/2603.03276
4. BeyondSWE: Can Current Code Agent Survive Beyond Single-Repo Bug Fixing?
https://huggingface.co/papers/2603.03194
5. Beyond Length Scaling: Synergizing Breadth and Depth for Generative Reward Models
https://huggingface.co/papers/2603.01571
Daily AI Papers Briefing (2026-03-04)
【本日の論文】
1. From Scale to Speed: Adaptive Test-Time Scaling for Image Editing
https://huggingface.co/papers/2603.00141
2. OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens
https://huggingface.co/papers/2603.02138
3. SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale
https://huggingface.co/papers/2602.23866
4. RubricBench: Aligning Model-Generated Rubrics with Human Standards
https://huggingface.co/papers/2603.01562
5. OpenAutoNLU: Open Source AutoML Library for NLU
https://huggingface.co/papers/2603.01824
Daily AI Papers Briefing (2026-03-03)
【本日の論文】
1. dLLM: Simple Diffusion Language Modeling
https://huggingface.co/papers/2602.22661
2. Enhancing Spatial Understanding in Image Generation via Reward Modeling
https://huggingface.co/papers/2602.24233
3. Recovered in Translation: Efficient Pipeline for Automated Translation of Benchmarks and Datasets
https://huggingface.co/papers/2602.22207
4. CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
https://huggingface.co/papers/2602.24286
5. Mode Seeking meets Mean Seeking for Fast Long Video Generation
https://huggingface.co/papers/2602.24289
Daily AI Papers Briefing (2026-03-02)
【本日の論文】
1. The Trinity of Consistency as a Defining Principle for General World Models
https://huggingface.co/papers/2602.23152
2. From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models
https://huggingface.co/papers/2602.22859
3. MobilityBench: A Benchmark for Evaluating Route-Planning Agents in Real-World Mobility Scenarios
https://huggingface.co/papers/2602.22638
4. OmniGAIA: Towards Native Omni-Modal AI Agents
https://huggingface.co/papers/2602.22897
5. Imagination Helps Visual Reasoning, But Not Yet in Latent Space
https://huggingface.co/papers/2602.22766
Daily AI Papers Briefing (2026-03-01)
【本日の論文】
1. The Trinity of Consistency as a Defining Principle for General World Models
https://huggingface.co/papers/2602.23152
2. From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models
https://huggingface.co/papers/2602.22859
3. MobilityBench: A Benchmark for Evaluating Route-Planning Agents in Real-World Mobility Scenarios
https://huggingface.co/papers/2602.22638
4. OmniGAIA: Towards Native Omni-Modal AI Agents
https://huggingface.co/papers/2602.22897
5. Imagination Helps Visual Reasoning, But Not Yet in Latent Space
https://huggingface.co/papers/2602.22766
Daily AI Papers Briefing (2026-02-28)
【本日の論文】
1. The Trinity of Consistency as a Defining Principle for General World Models
https://huggingface.co/papers/2602.23152
2. From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models
https://huggingface.co/papers/2602.22859
3. MobilityBench: A Benchmark for Evaluating Route-Planning Agents in Real-World Mobility Scenarios
https://huggingface.co/papers/2602.22638
4. OmniGAIA: Towards Native Omni-Modal AI Agents
https://huggingface.co/papers/2602.22897
5. Imagination Helps Visual Reasoning, But Not Yet in Latent Space
https://huggingface.co/papers/2602.22766
Daily AI Papers Briefing (2026-02-27)
【本日の論文】
1. HyTRec: A Hybrid Temporal-Aware Attention Architecture for Long Behavior Sequential Recommendation
https://huggingface.co/papers/2602.18283
2. MolHIT: Advancing Molecular-Graph Generation with Hierarchical Discrete Diffusion Models
https://huggingface.co/papers/2602.17602
3. DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation
https://huggingface.co/papers/2602.12160
4. SkyReels-V4: Multi-modal Video-Audio Generation, Inpainting and Editing model
https://huggingface.co/papers/2602.21818
5. ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning
https://huggingface.co/papers/2602.21534
Daily AI Papers Briefing (2026-02-26)
【本日の論文】
1. On Data Engineering for Scaling LLM Terminal Capabilities
https://huggingface.co/papers/2602.21193
2. Query-focused and Memory-aware Reranker for Long Context Processing
https://huggingface.co/papers/2602.12192
3. PyVision-RL: Forging Open Agentic Vision Models via RL
https://huggingface.co/papers/2602.20739
4. From Perception to Action: An Interactive Benchmark for Vision Reasoning
https://huggingface.co/papers/2602.21015
5. Test-Time Training with KV Binding Is Secretly Linear Attention
https://huggingface.co/papers/2602.21204
Daily AI Papers Briefing (2026-02-25)
【本日の論文】
1. A Very Big Video Reasoning Suite
https://huggingface.co/papers/2602.20159
2. VLANeXt: Recipes for Building Strong VLA Models
https://huggingface.co/papers/2602.18532
3. SkillOrchestra: Learning to Route Agents via Skill Transfer
https://huggingface.co/papers/2602.19672
4. TOPReward: Token Probabilities as Hidden Zero-Shot Rewards for Robotics
https://huggingface.co/papers/2602.19313
5. Mobile-O: Unified Multimodal Understanding and Generation on Mobile Device
https://huggingface.co/papers/2602.20161
Daily AI Papers Briefing (2026-02-24)
【本日の論文】
1. VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training
https://huggingface.co/papers/2602.10693
2. Does Your Reasoning Model Implicitly Know When to Stop Thinking?
https://huggingface.co/papers/2602.08354
3. Generated Reality: Human-centric World Simulation using Interactive Video Generation with Hand and Camera Control
https://huggingface.co/papers/2602.18422
4. Decoding as Optimisation on the Probability Simplex: From Top-K to Top-P (Nucleus) to Best-of-K Samplers
https://huggingface.co/papers/2602.18292
5. Spanning the Visual Analogy Space with a Weight Basis of LoRAs
https://huggingface.co/papers/2602.15727
Daily AI Papers Briefing (2026-02-23)
【本日の論文】
1. SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning
https://huggingface.co/papers/2602.13515
2. Mobile-Agent-v3.5: Multi-platform Fundamental GUI Agents
https://huggingface.co/papers/2602.16855
3. Unified Latents (UL): How to train your latents
https://huggingface.co/papers/2602.17270
4. Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report v1.5
https://huggingface.co/papers/2602.14457
5. Arcee Trinity Large Technical Report
https://huggingface.co/papers/2602.17004
Daily AI Papers Briefing (2026-02-22)
【本日の論文】
1. SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning
https://huggingface.co/papers/2602.13515
2. Mobile-Agent-v3.5: Multi-platform Fundamental GUI Agents
https://huggingface.co/papers/2602.16855
3. Unified Latents (UL): How to train your latents
https://huggingface.co/papers/2602.17270
4. Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report v1.5
https://huggingface.co/papers/2602.14457
5. Arcee Trinity Large Technical Report
https://huggingface.co/papers/2602.17004
Daily AI Papers Briefing (2026-02-21)
【本日の論文】
1. SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning
https://huggingface.co/papers/2602.13515
2. Unified Latents (UL): How to train your latents
https://huggingface.co/papers/2602.17270
3. Mobile-Agent-v3.5: Multi-platform Fundamental GUI Agents
https://huggingface.co/papers/2602.16855
4. "What Are You Doing?": Effects of Intermediate Feedback from Agentic LLM In-Car Assistants During Multi-Step Processing
https://huggingface.co/papers/2602.15569
5. Calibrate-Then-Act: Cost-Aware Exploration in LLM Agents
https://huggingface.co/papers/2602.16699
Daily AI Papers Briefing (2026-02-20)
【本日の論文】
1. SLA2: Sparse-Linear Attention with Learnable Routing and QAT
https://huggingface.co/papers/2602.12675
2. RynnBrain: Open Embodied Foundation Models
https://huggingface.co/papers/2602.14979
3. Learning Humanoid End-Effector Control for Open-Vocabulary Visual Loco-Manipulation
https://huggingface.co/papers/2602.16705
4. CADEvolve: Creating Realistic CAD via Program Evolution
https://huggingface.co/papers/2602.16317
5. Empty Shelves or Lost Keys? Recall Is the Bottleneck for Parametric Factuality
https://huggingface.co/papers/2602.14080
Daily AI Papers Briefing (2026-02-19)
【本日の論文】
1. Sanity Checks for Sparse Autoencoders: Do SAEs Beat Random Baselines?
https://huggingface.co/papers/2602.14111
2. SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks
https://huggingface.co/papers/2602.12670
3. GLM-5: from Vibe Coding to Agentic Engineering
https://huggingface.co/papers/2602.15763
4. Does Socialization Emerge in AI Agent Society? A Case Study of Moltbook
https://huggingface.co/papers/2602.14299
5. ResearchGym: Evaluating Language Model Agents on Real-World AI Research
https://huggingface.co/papers/2602.15112
Daily AI Papers Briefing (2026-02-18)
【本日の論文】
1. Experiential Reinforcement Learning
https://huggingface.co/papers/2602.13949
2. DeepImageSearch: Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories
https://huggingface.co/papers/2602.10809
3. REDSearcher: A Scalable and Cost-Efficient Framework for Long-Horizon Search Agents
https://huggingface.co/papers/2602.14234
4. STATe-of-Thoughts: Structured Action Templates for Tree-of-Thoughts
https://huggingface.co/papers/2602.14265
5. Query as Anchor: Scenario-Adaptive User Representation via Large Language Model
https://huggingface.co/papers/2602.14492
Daily AI Papers Briefing (2026-02-17)
【本日の論文】
1. Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs
https://huggingface.co/papers/2602.10388
2. SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise
https://huggingface.co/papers/2602.12783
3. MedXIAOHE: A Comprehensive Recipe for Building Medical MLLMs
https://huggingface.co/papers/2602.12705
4. Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception
https://huggingface.co/papers/2602.11858
5. OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence
https://huggingface.co/papers/2602.08683
Daily AI Papers Briefing (2026-02-05)
【本日の論文】
1. CodeOCR: On the Effectiveness of Vision Language Models in Code Understanding
https://huggingface.co/papers/2602.01785
2. AOrchestra: Automating Sub-Agent Creation for Agentic Orchestration
https://huggingface.co/papers/2602.03786
3. No Global Plan in Chain-of-Thought: Uncover the Latent Planning Horizon of LLMs
https://huggingface.co/papers/2602.02103
4. MARS: Modular Agent with Reflective Search for Automated AI Research
https://huggingface.co/papers/2602.02660
5. daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently
https://huggingface.co/papers/2602.02619
Daily AI Papers Briefing (2026-02-04)
【本日の論文】
1. Green-VLA: Staged Vision-Language-Action Model for Generalist Robots
https://huggingface.co/papers/2602.00919
2. Kimi K2.5: Visual Agentic Intelligence
https://huggingface.co/papers/2602.02276
3. Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models
https://huggingface.co/papers/2601.22060
4. Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models
https://huggingface.co/papers/2602.02185
5. Closing the Loop: Universal Repository Representation with RPG-Encoder
https://huggingface.co/papers/2602.02084
Daily AI Papers Briefing (2026-02-02)
【本日の論文】
1. Idea2Story: An Automated Pipeline for Transforming Research Concepts into Complete Scientific Narratives
https://huggingface.co/papers/2601.20833
2. Everything in Its Place: Benchmarking Spatial Intelligence of Text-to-Image Models
https://huggingface.co/papers/2601.20354
3. Scaling Embeddings Outperforms Scaling Experts in Language Models
https://huggingface.co/papers/2601.21204
4. DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation
https://huggingface.co/papers/2601.22153
5. MMFineReason: Closing the Multimodal Reasoning Gap via Open Data-Centric Methods
https://huggingface.co/papers/2601.21821
Daily AI Papers Briefing (2026-02-01)
【本日の論文】
1. Idea2Story: An Automated Pipeline for Transforming Research Concepts into Complete Scientific Narratives
https://huggingface.co/papers/2601.20833
2. Everything in Its Place: Benchmarking Spatial Intelligence of Text-to-Image Models
https://huggingface.co/papers/2601.20354
3. Scaling Embeddings Outperforms Scaling Experts in Language Models
https://huggingface.co/papers/2601.21204
4. DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation
https://huggingface.co/papers/2601.22153
5. MMFineReason: Closing the Multimodal Reasoning Gap via Open Data-Centric Methods
https://huggingface.co/papers/2601.21821
Daily AI Papers Briefing (2026-01-31)
【本日の論文】
1. Idea2Story: An Automated Pipeline for Transforming Research Concepts into Complete Scientific Narratives
https://huggingface.co/papers/2601.20833
2. Everything in Its Place: Benchmarking Spatial Intelligence of Text-to-Image Models
https://huggingface.co/papers/2601.20354
3. Scaling Embeddings Outperforms Scaling Experts in Language Models
https://huggingface.co/papers/2601.21204
4. DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation
https://huggingface.co/papers/2601.22153
5. OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models
https://huggingface.co/papers/2601.21639
Daily AI Papers Briefing (2026-01-30)
【本日の論文】
1. Harder Is Better: Boosting Mathematical Reasoning via Difficulty-Aware GRPO and Multi-Aspect Question Reformulation
https://huggingface.co/papers/2601.20614
2. Advancing Open-source World Models
https://huggingface.co/papers/2601.20540
3. Innovator-VL: A Multimodal Large Language Model for Scientific Discovery
https://huggingface.co/papers/2601.19325
4. DeepSeek-OCR 2: Visual Causal Flow
https://huggingface.co/papers/2601.20552
5. Spark: Strategic Policy-Aware Exploration via Dynamic Branching for Long-Horizon Agentic Learning
https://huggingface.co/papers/2601.20209
Daily AI Papers Briefing (2026-01-29)
【本日の論文】
1. AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security
https://huggingface.co/papers/2601.18491
2. AdaReasoner: Dynamic Tool Orchestration for Iterative Visual Reasoning
https://huggingface.co/papers/2601.18631
3. A Pragmatic VLA Foundation Model
https://huggingface.co/papers/2601.18692
4. Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models
https://huggingface.co/papers/2601.19834
5. AVMeme Exam: A Multimodal Multilingual Multicultural Benchmark for LLMs' Contextual and Cultural Knowledge and Thinking
https://huggingface.co/papers/2601.17645