E-Book, Englisch, Band 15107, 511 Seiten, eBook
18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part XLIX
E-Book, Englisch, Band 15107, 511 Seiten, eBook
Reihe: Lecture Notes in Computer Science
ISBN: 978-3-031-72967-6
Verlag: Springer International Publishing
Format: PDF
Kopierschutz: 1 - PDF Watermark
Zielgruppe
Research
Autoren/Hrsg.
Weitere Infos & Material
Real-time Holistic Robot Pose Estimation with Unknown States.- CLOSER: Towards Better Representation Learning for Few-Shot Class-Incremental Learning.- A Simple Baseline for Spoken Language to Sign Language Translation with 3D Avatars.- An accurate detection is not all you need to combat label noise in web-noisy datasets.- Online Vectorized HD Map Construction using Geometry.- Image-adaptive 3D Lookup Tables for Real-time Image Enhancement with Bilateral Grids.- Learned HDR Image Compression for Perceptually Optimal Storage and Display.- Sparse Beats Dense: Rethinking Supervision in Radar-Camera Depth Completion.- Non-Exemplar Domain Incremental Learning via Cross-Domain Concept Integration.- Free-VSC: Free Semantics from Visual Foundation Models for Unsupervised Video Semantic Compression.- Improving Virtual Try-On with Garment-focused Diffusion Models.- Ray Denoising: Depth-aware Hard Negative Sampling for Multi-view 3D Object Detection.- Disentangled Generation and Aggregation for Robust Radiance Fields.- UNIKD: UNcertainty-Filtered Incremental Knowledge Distillation for Neural Implicit Representation.- Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic Segmentation.- MoAI: Mixture of All Intelligence for Large Language and Vision Models.- Semantic-guided Robustness Tuning for Few-Shot Transfer Across Extreme Domain Shift.- Revisit Event Generation Model: Self-Supervised Learning of Event-to-Video Reconstruction with Implicit Neural Representations.- SDPT: Synchronous Dual Prompt Tuning for Fusion-based Visual-Language Pre-trained Models.- Open-World Dynamic Prompt and Continual Visual Representation Learning.- Learning Video Context as Interleaved Multimodal Sequences.- Learning Unsigned Distance Functions from Multi-view Images with Volume Rendering Priors.- Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding.- Deep Feature Surgery: Towards Accurate and Efficient Multi-Exit Networks.- Multi-scale Cross Distillation for Object Detection in Aerial Images.- Progressive Proxy Anchor Propagation for Unsupervised Semantic Segmentation.- Within the Dynamic Context: Inertia-aware 3D Human Modeling with Pose Sequence.