E-Book, Englisch, Band 15126, 484 Seiten, eBook
18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part LXVIII
E-Book, Englisch, Band 15126, 484 Seiten, eBook
Reihe: Lecture Notes in Computer Science
ISBN: 978-3-031-73113-6
Verlag: Springer International Publishing
Format: PDF
Kopierschutz: 1 - PDF Watermark
Zielgruppe
Research
Autoren/Hrsg.
Weitere Infos & Material
Reinforcement Learning Friendly Vision-Language Model for Minecraft.- Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation.- Training-free Composite Scene Generation for Layout-to-Image Synthesis.- Robustness Preserving Fine-tuning using Neuron Importance.- ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation.- PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generation.- Similarity of Neural Architectures using Adversarial Attack Transferability.- Dual-Rain: Video Rain Removal using Assertive and Gentle Teachers.- PMT: Progressive Mean Teacher via Exploring Temporal Consistency for Semi-Supervised Medical Image Segmentation.- OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist Autonomous Agents for Desktop and Web.- AutoEval-Video: An Automatic Benchmark for Assessing Large Vision Language Models in Open-Ended Video Question Answering.- Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models.- Unsupervised Variational Translator for Bridging Image Restoration and High-Level Vision Tasks.- Diffusion Model for Robust Multi-Sensor Fusion in 3D Object Detection and BEV Segmentation.- MeshAvatar: Learning High-quality Triangular Human Avatars from Multi-view Videos.- Fast Point Cloud Geometry Compression with Context-based Residual Coding and INR-based Refinement.- Scene-Conditional 3D Object Stylization and Composition.- GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning.- Revisit Anything: Visual Place Recognition via Image Segment Retrieval.- EcoMatcher: Efficient Clustering Oriented Matcher for Detector-free Image Matching.- DGD: Dynamic 3D Gaussians Distillation.- Semantic Diversity-aware Prototype-based Learning for Unbiased Scene Graph Generation.- DiffuMatting: Synthesizing Arbitrary Objects with Matting-level Annotation.- Self-Guided Generation of Minority Samples Using Diffusion Models.- DEVIAS: Learning Disentangled Video Representations of Action and Scene.- AD3: Introducing a score for Anomaly Detection Dataset Difficulty assessment using VIADUCT dataset.- RoomTex: Texturing Compositional Indoor Scenes via Iterative Inpainting.