E-Book, Englisch, Band 15121, 488 Seiten, eBook
18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part LXIII
E-Book, Englisch, Band 15121, 488 Seiten, eBook
Reihe: Lecture Notes in Computer Science
ISBN: 978-3-031-73036-8
Verlag: Springer International Publishing
Format: PDF
Kopierschutz: 1 - PDF Watermark
The multi-volume set of LNCS books with volume numbers 15059 up to 15147 constitutes the refereed proceedings of the 18th European Conference on Computer Vision, ECCV 2024, held in Milan, Italy, during September 29–October 4, 2024.
The 2387 papers presented in these proceedings were carefully reviewed and selected from a total of 8585 submissions. They deal with topics such as computer vision; machine learning; deep neural networks; reinforcement learning; object recognition; image classification; image processing; object detection; semantic segmentation; human pose estimation; 3d reconstruction; stereo vision; computational photography; neural networks; image coding; image reconstruction; motion estimation.
Zielgruppe
Research
Autoren/Hrsg.
Weitere Infos & Material
Large-scale Reinforcement Learning for Diffusion Models.- CoMusion: Towards Consistent Stochastic Human Motion Prediction via Motion Diffusion.- FedHARM: Harmonizing Model Architectural Diversity in Federated Learning.- EAGLES: Efficient Accelerated 3D Gaussians with Lightweight EncodingS.- Global Counterfactual Directions.- TCLC-GS: Tightly Coupled LiDAR-Camera Gaussian Splatting for Autonomous Driving.- RT-Pose: A 4D Radar-Tensor based 3D Human Pose Estimation and Localization Benchmark.- EditShield: Protecting Unauthorized Image Editing by Instruction-guided Diffusion Models.- RICA^2: Rubric-Informed, Calibrated Assessment of Actions.- Region-centric Image-Language Pretraining for Open-Vocabulary Detection.- Commonly Interesting Images.- Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities.- CriSp: Leveraging Tread Depth Maps for Enhanced Crime-Scene Shoeprint Matching.- Caltech Aerial RGB-Thermal Dataset in the Wild.- Diffusion Soup: Model Merging for Text-to-Image Diffusion Models.- Volumetric Rendering with Baked Quadrature Fields.- CityGuessr: City-Level Video Geo-Localization on a Global Scale.- Pseudo-Labelling Should Be Aware of Disguising Channel Activations.- Bayesian Detector Combination for Object Detection with Crowdsourced Annotations.- Revising Densification in Gaussian Splatting.- FlexiEdit: Frequency-Aware Latent Refinement for Enhanced Non-Rigid Editing.- Smoothness, Synthesis, and Sampling: Re-thinking Unsupervised Multi-View Stereo with DIV Loss.- Text Motion Translator: A Bi-Directional Model for Enhanced 3D Human Motion Generation from Open-Vocabulary Descriptions.- UL-VIO: Ultra-lightweight Visual-Inertial Odometry with Noise Robust Test-time Adaptation.- PolyOculus: Simultaneous Multi-view Image-based Novel View Synthesis.- R3DS: Reality-linked 3D Scenes for Panoramic Scene Understanding.- A Graph-Based Approach for Category-Agnostic Pose Estimation.