Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey Paper • 2412.18619 • Published 26 days ago • 52
FRNet: Frustum-Range Networks for Scalable LiDAR Segmentation Paper • 2312.04484 • Published Dec 7, 2023
LiMoE: Mixture of LiDAR Representation Learners from Automotive Scenes Paper • 2501.04004 • Published 4 days ago • 1
Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives Paper • 2501.04003 • Published 4 days ago • 14
LargeAD: Large-Scale Cross-Sensor Data Pretraining for Autonomous Driving Paper • 2501.04005 • Published 4 days ago
OVGaussian: Generalizable 3D Gaussian Segmentation with Open Vocabularies Paper • 2501.00326 • Published 11 days ago • 1
Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding Paper • 2501.00712 • Published 11 days ago • 5
CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings Paper • 2501.01257 • Published 9 days ago • 45
DateLogicQA: Benchmarking Temporal Biases in Large Language Models Paper • 2412.13377 • Published 25 days ago • 2
view post Post 4593 Google drops Gemini 2.0 Flash Thinkinga new experimental model that unlocks stronger reasoning capabilities and shows its thoughts. The model plans (with thoughts visible), can solve complex problems with Flash speeds, and morenow available in anychat, try it out: akhaliq/anychat See translation 🚀 6 6 🔥 4 4 👀 1 1 + Reply
Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation Paper • 2406.02347 • Published Jun 4, 2024 • 2
FlexEvent: Event Camera Object Detection at Arbitrary Frequencies Paper • 2412.06708 • Published Dec 9, 2024
ProcessBench: Identifying Process Errors in Mathematical Reasoning Paper • 2412.06559 • Published Dec 9, 2024 • 74
Unsupervised Video Domain Adaptation for Action Recognition: A Disentanglement Perspective Paper • 2208.07365 • Published Aug 15, 2022
4D Contrastive Superflows are Dense 3D Representation Learners Paper • 2407.06190 • Published Jul 8, 2024
Aya Expanse: Combining Research Breakthroughs for a New Multilingual Frontier Paper • 2412.04261 • Published Dec 5, 2024 • 1
SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding Paper • 2412.04383 • Published Dec 5, 2024 • 4