E-Book, Englisch, Band 15080, 493 Seiten, eBook
18th European Conference, Milan, Italy, September 29 – October 4, 2024, Proceedings, Part XXII
E-Book, Englisch, Band 15080, 493 Seiten, eBook
Reihe: Lecture Notes in Computer Science
ISBN: 978-3-031-72670-5
Verlag: Springer International Publishing
Format: PDF
Kopierschutz: 1 - PDF Watermark
Zielgruppe
Research
Autoren/Hrsg.
Weitere Infos & Material
GS-LRM: Large Reconstruction Model for 3D Gaussian Splatting.- Robust-Wide: Robust Watermarking against Instruction-driven Image Editing.- OAPT: Offset-Aware Partition Transformer for Double JPEG Artifacts Removal.- Formula-Supervised Visual-Geometric Pre-training.- VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding.- Towards Unified Representation of Invariant-Specific Features in Missing Modality Face Anti-Spoofing.- Restoring Images in Adverse Weather Conditions via Histogram Transformer.- PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest Transformer.- NGP-RT: Fusing Multi-Level Hash Features with Lightweight Attention for Real-Time Novel View Synthesis.- Elysium: Exploring Object-level Perception in Videos through Semantic Integration Using MLLMs.- G2fR: Frequency Regularization in Grid-based Feature Encoding Neural Radiance Fields.- Getting it Right: Improving Spatial Consistency in Text-to-Image Models.- Generating 3D House Wireframes with Semantics.- GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image.- Shape-guided Configuration-aware Learning for Endoscopic-image-based Pose Estimation of Flexible Robotic Instruments.- Nonverbal Interaction Detection.- UniM2AE: Multi-modal Masked Autoencoders with Unified 3D Representation for 3D Perception in Autonomous Driving.- Responsible Visual Editing.- Drag Anything: Motion Control for Anything using Entity Representation.- SegPoint: Segment Any Point Cloud via Large Language Model.- Navigation Instruction Generation with BEV Perception and Large Language Models.- Rebalancing Using Estimated Class Distribution for Imbalanced Semi-Supervised Learning under Class Distribution Mismatch.- Vista3D: unravel the 3d darkside of a single image.- The Fabrication of Reality and Fantasy: Scene Generation with LLM-Assisted Prompt Interpretation.- Detecting As Labeling: Rethinking LiDAR-camera Fusion in 3D Object Detection.- FlashSplat: 2D to 3D Gaussian Splatting Segmentation Solved Optimally.- Exploiting Dual-Correlation for Multi-frame Time-of-Flight Denoising.