Ege Oezsoy
            
            @EgeOezsoy
Followers
                338
              Following
                503
              Media
                15
              Statuses
                84
              PhD Student at TUM/CAMP. Semantic Scene Graphs, 3D Computer Vision, Deep Learning, Surgery Understanding
              
              Joined November 2012
            
            
           Excited to present two works at #CVPR2025! If you're attending CVPR, feel free to stop by and say hi! 📍 Main Conference MM-OR: A Large Multimodal OR Dataset for Semantic Surgical Scene Understanding Arxiv:  https://t.co/85eZUlksoS  Poster Session: 4 Poster ID: 341 
          
            
            arxiv.org
              Operating rooms (ORs) are complex, high-stakes environments requiring precise understanding of interactions among medical staff, tools, and equipment for enhancing surgical assistance, situational...
            
                
                1
              
              
                
                0
              
              
                
                10
              
             This work was co-led with @arda_mamur , and realized together with our fantastic co-authors Felix Tristram, @ChPellegrini , @aiiakisi , @BusamBenjamin, and @NassirNavab . A huge thanks to them and to everyone who supported us along the way. 
          
                
                0
              
              
                
                0
              
              
                
                0
              
             We believe EgoExOR will catalyze innovation in OR perception, paving the way for smarter, safer surgical environments, with implications for any field requiring precise human-robot collaboration in complex environments. 
          
                
                1
              
              
                
                0
              
              
                
                0
              
             Captured with up to five egocentric Meta ARIA glasses and five external RGB-D cameras, ultrasound monitor feed, audio, fused 3D point-cloud reconstructions, gaze, hand-pose signals, and detailed scene graph annotations, it provides a comprehensive view of surgical workflows. 
          
                
                1
              
              
                
                0
              
              
                
                0
              
             EgoExOR is the first operating room dataset that fuses egocentric and exocentric perspectives, covering two emulated spine procedures for spinal pain relief and herniated disc removal: Ultrasound-Guided Needle Insertion and Minimally Invasive Spine Surgery. 
          
                
                1
              
              
                
                0
              
              
                
                0
              
             🎉 Excited to share that our paper “EgoExOR: An Ego–Exo-Centric Operating Room Dataset for Surgical Activity Understanding” has been accepted to #NeurIPS2025. #SDS #Egocentric #SceneGraphs #OperatingRoom Paper:  https://t.co/YOKuuv3Edg  Code & Data:  https://t.co/wIj0ml7INe 
          
          
                
                1
              
              
                
                2
              
              
                
                6
              
             📍 Multimodal Learning & Applications Workshop LF-SGG: Location-Free Scene Graph Generation 🔗 
          
                
                0
              
              
                
                0
              
              
                
                2
              
             I am happy to share that RaDialog was accepted to MIDL 2025! We look forward to presenting the newest version of RaDialog, with an improved architecture, extended instruct dataset, more thorough evaluation and analysis and Hugging Face integration. 
          
                
                1
              
              
                
                3
              
              
                
                14
              
             🎉 Thrilled to share MM-OR has been accepted to #CVPR2025. MM-OR is a new, large scale multimodal operating room dataset for semantic understanding of surgeries. It captures robotic knee replacement surgeries with multi-view RGB-D video, audio, speech transcripts, robotic logs, 
          
                
                2
              
              
                
                4
              
              
                
                17
              
             We believe that our new task and its objective evaluation paves the way to more efficient scene interpretation from visual content in the form of scene graphs. 
          
                
                0
              
              
                
                0
              
              
                
                2
              
             We demonstrate the effectiveness of our method on three scene graph generation datasets as well as two downstream tasks, image retrieval and visual question answering, and show that our approach is competitive to existing methods while not relying on location cues. 
          
                
                1
              
              
                
                0
              
              
                
                2
              
             We design the first LF-SGG method, Pix2SG, using autoregressive sequence modeling and to objectively evaluate LF-SGG, we design and implement an efficient branching algorithm. 
          
                
                1
              
              
                
                0
              
              
                
                0
              
             Recognizing that many downstream application do not require location data (image retrieval, visual question answering, image captioning, and more), we break this dependency and introduce location-free scene graph generation (LF-SGG). 
          
                
                1
              
              
                
                0
              
              
                
                0
              
             Traditional scene graph generation methods rely on location labels in form of bounding boxes or segmentation masks, increasing annotation costs and limiting dataset expansion. 
          
                
                1
              
              
                
                0
              
              
                
                0
              
             Excited to introduce the task of Location-Free Scene Graph Generation (LF-SGG). To this end, we design the first LF-SGG method, Pix2SG and the corresponding evaluation method based on heuristic tree search. Paper:  https://t.co/S92na8xCUP  Code:  https://t.co/8I4g9ounjG 
          
          
                
                1
              
              
                
                0
              
              
                
                1
              
             Dear all, This is the last magazine by RSIP Vision:  https://t.co/ANqR0V5JoL  The company has sent 202 magazines since April 2016. Computer Vision News will continue its publication. Enjoy the reading and... don't miss the cat on page 43! Ralph 
          
                
                1
              
              
                
                6
              
              
                
                36
              
             Title: ORacle: Large Vision-Language Models for Knowledge-Guided Holistic OR Domain Modeling Paper Link:  https://t.co/NCO7wXOClN  Authors: Ege Özsoy*, Chantal Pellegrini*, Matthias Keicher, Nassir Navab Code:   https://t.co/JvE505knDX 
          
          
                
                0
              
              
                
                0
              
              
                
                1
              
             We are extremely honered that our work“ORacle“ has received the Best Paper Runner-Up Award at MICCAI 2024! Huge thanks to our supervisors @MatthiasKeicher @NassirNavab
            #MICCAI2024 #BestPaperRunnerUp
          
          
                
                1
              
              
                
                3
              
              
                
                44
              
             Another very well-deserved oral from E. Ă–zsoy from @NassirNavab lab. Really enjoying the current #miccai2024 session.  https://t.co/wZMtxYissk 
          
          
                
                1
              
              
                
                3
              
              
                
                26
              
             Poster Number: W-PM-188 Poster Session 6, Computer Assisted Interventions and Surgery @MICCAI_Society @MiccaiStudents
          
          
                
                0
              
              
                
                0
              
              
                
                0