E-Book, Englisch, Band 15085, 485 Seiten, eBook
18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part XXVII
E-Book, Englisch, Band 15085, 485 Seiten, eBook
Reihe: Lecture Notes in Computer Science
ISBN: 978-3-031-73383-3
Verlag: Springer International Publishing
Format: PDF
Kopierschutz: 1 - PDF Watermark
Zielgruppe
Research
Autoren/Hrsg.
Weitere Infos & Material
SLAck: Semantic, Location, and Appearance Aware Open-Vocabulary Tracking.- Tensorial template matching for fast cross-correlation with rotations and its application for tomography.- FreeAugment: Data Augmentation Search Across All Degrees of Freedom.- Learning Representations of Satellite Images From Metadata Supervision.- I2-SLAM: Inverting Imaging Process for Robust Photorealistic Dense SLAM.- FlashTex: Fast Relightable Mesh Texturing with LightControlNet.- GS-Pose: Category-Level Object Pose Estimation via Geometric and Semantic Correspondence.- ArtVLM: Attribute Recognition Through Vision-Based Prefix Language Modeling.- PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance.- SOS: Segment Object System for Open-World Instance Segmentation With Object Priors.- Lagrangian Hashing for Compressed Neural Field Representations.- EDformer: Transformer-Based Event Denoising Across Varied Noise Levels.- Foster Adaptivity and Balance in Learning with Noisy Labels.- MetaAug: Meta-Data Augmentation for Post-Training Quantization.- Thermal3D-GS: Physics-induced 3D Gaussians for Thermal Infrared Novel-view Synthesis.- Cross-Platform Video Person ReID: A New Benchmark Dataset and Adaptation Approach.- Unleashing the Power of Prompt-driven Nucleus Instance Segmentation.- Gaze Target Detection Based on Head-Local-Global Coordination.- 3DSA:Multi-View 3D Human Pose Estimation With 3D Space Attention Mechanisms.- Toward Tiny and High-quality Facial Makeup with Data Amplify Learning.- An Economic Framework for 6-DoF Grasp Detection.- GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction.- Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning.- AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer.- Multi-Label Cluster Discrimination for Visual Representation Learning.- Plan, Posture and Go: Towards Open-vocabulary Text-to-Motion Generation.- DAMSDet: Dynamic Adaptive Multispectral Detection Transformer with Competitive Query Selection and Adaptive Feature Fusion.