Alexey Bokhovkin Profile
Alexey Bokhovkin

@ABokhovkin

Followers
356
Following
68
Media
3
Statuses
44

Computer Vision researcher @ TUM 3D Indoor Understanding

Joined December 2020
Don't wanna be here? Send us removal request.
@ABokhovkin
Alexey Bokhovkin
5 months
๐Ÿ“ขSceneFactor code is released! SceneFactor is a factored latent diffusion for controllable, large-scale scene synthesis and editing! w/ @QTDSMQ, @shubhtuls, @angelaqdai Check out the code here: https://t.co/tJFRAPXKEI. We present SceneFactor at #CVPR2025 on Fri 13, -10:30
0
7
23
@MattNiessner
Matthias Niessner
7 months
๐Ÿ“ขAnimating the Uncaptured ๐Ÿ“ข We animate 3D humanoid meshes using video diffusion priors given a text prompt. ๐ŸŽฅ https://t.co/EpFW86gaRw ๐ŸŒ https://t.co/suMQs8oQCL Realistic motion generation for 3D characters - without motion capture! ๐Ÿš€ Great work by @marcbenedi @angelaqdai
3
40
124
@angelaqdai
Angela Dai
7 months
๐Ÿ“ขExCap3D: Multilevel Captioning of Objects in 3D Scenes @chandan__yes generates consistent object and part-level descriptions of objects in 3D scenes, and introduces a new dataset with 190k captions for 34k ScanNet++ objects. Project: https://t.co/6tWzlYsx5F w/ @david_roz_
0
30
110
@angelaqdai
Angela Dai
9 months
๐Ÿ“ข ScanNet++ v2 Benchmark Release! ๐Ÿ† Test your state-of-the-art models on: ๐Ÿ”น Novel View Synthesis ๐Ÿ“ธโžก๏ธ๐Ÿ–ผ๏ธ ๐Ÿ”น 3D Semantic & Instance Segmentation ๐Ÿค–๐Ÿ”๐Ÿ•ถ๏ธ Shoutout to @chandan__yes and @liuyuehcheng for their incredible work๐Ÿ‘ ๐Ÿš€Check it out: https://t.co/SKCGM23hA0
2
42
203
@angelaqdai
Angela Dai
10 months
๐Ÿ“ขMeshArt: Generating Articulated Meshes with Structure-guided Transformers @DaoyiGao generates articulated meshes with a hierarchical transformer, modeling articulation-aware structures that guide mesh synthesis. w/ @yawarnihal @craigleili Project: https://t.co/aZPVyn8kQd
2
65
290
@angelaqdai
Angela Dai
10 months
Excited to announce ScanNet++ v2!๐ŸŽ‰ @chandan__yes and @liuyuehcheng have been working tirelessly to bring: ๐Ÿ”น1006 high-fidelity 3D scans ๐Ÿ”น+ DSLR & iPhone captures ๐Ÿ”น+ rich semantics Elevating 3D scene understanding to the next level!๐Ÿš€ w/ @MattNiessner https://t.co/QayR1S8KZZ
6
113
642
@MattNiessner
Matthias Niessner
10 months
๐Ÿ“ข๐Ÿ“ข๐†๐€๐…: ๐†๐š๐ฎ๐ฌ๐ฌ๐ข๐š๐ง ๐€๐ฏ๐š๐ญ๐š๐ซ ๐‘๐ž๐œ๐จ๐ง๐ฌ๐ญ๐ซ๐ฎ๐œ๐ญ๐ข๐จ๐ง ๐Ÿ๐ซ๐จ๐ฆ ๐Œ๐จ๐ง๐จ๐œ๐ฎ๐ฅ๐š๐ซ ๐•๐ข๐๐ž๐จ๐ฌ ๐ฏ๐ข๐š ๐Œ๐ฎ๐ฅ๐ญ๐ข-๐ฏ๐ข๐ž๐ฐ ๐ƒ๐ข๐Ÿ๐Ÿ๐ฎ๐ฌ๐ข๐จ๐ง๐Ÿ“ข๐Ÿ“ข We reconstruct animatable Gaussian head avatars from monocular videos captured by commodity devices such as
2
31
125
@angelaqdai
Angela Dai
11 months
๐Ÿ“ขDNF: Generating 4D animations with dictionary-based neural fields! @xinyi092298 presents a new dictionary-based neural field for unconditional 4D generation of deforming shapes -- generating motions with high-quality shape and temporal consistency. https://t.co/yAZi2k0PjB
0
44
147
@ABokhovkin
Alexey Bokhovkin
11 months
I'm so excited to introduce SceneFactor!
@angelaqdai
Angela Dai
11 months
๐Ÿ“ขSceneFactor: Generating & editing 3D indoor scenes from text! @ABokhovkin presents a factored latent diffusion for controllable, large-scale scene synthesis -- decomposed into high-level semantic generation + geometric refinement w/ @QTDSMQ, @shubhtuls https://t.co/WGTw70cKIo
0
1
20
@MattNiessner
Matthias Niessner
11 months
๐Ÿ“ข๐Ÿ“ข ๐†๐š๐ฎ๐ฌ๐ฌ๐ข๐š๐ง๐’๐ฉ๐ž๐ž๐œ๐ก: Audio-Driven Gaussian Avatars ๐Ÿ“ข๐Ÿ“ข We synthesize photorealistic and 3D-consistent talking human head avatars driven directly from spoken audio. More specifically, we introduce an efficient 3DGS-based representation, combined with an
2
36
149
@angelaqdai
Angela Dai
1 year
How can we generate high-fidelity, complex 3D scenes? @QTDSMQ's LT3SD decomposes 3D scenes into latent tree representations, with diffusion on the latent trees enabling seamless infinite 3D scene synthesis! w/ @craigleili, @MattNiessner https://t.co/wv9bIhkkYi
3
77
318
@angelaqdai
Angela Dai
1 year
Excited to present DiffCAD coming to #SIGGRAPH2024! @DaoyiGao introduces the first probabilistic single-view CAD retrieval & alignment. We train only on synthetic -> generalize robustly to real images! Check out the code: https://t.co/hBCoN0Hx3w w/@david_roz_, @StefanLeuteneg1
0
31
116
@angelaqdai
Angela Dai
1 year
Excited to present GenZI at #CVPR2024! @craigleili introduces GenZI, the first zero-shot approach to creating realistic 3D human-scene interactions by leveraging interaction priors from large VLMs. Code and data on our website! https://t.co/hUhMgUoU70 https://t.co/rnn1G5HOuu
1
25
102
@artonson
Alexey Artemov
2 years
AutoInst: Automatic Instance-Based Segmentation of LiDAR 3D Scans A method for unsupervised instance segmentation of 3D outdoor LiDAR scenes. Project: https://t.co/m8DJanWH2T Vid: https://t.co/Z9OyZbskdJ Paper : https://t.co/rrmvQdjmWV
2
6
21
@angelaqdai
Angela Dai
2 years
Check out our #CVPR'24 papers on 3D human interactions, generative 3D modeling, and uncertainty-aware and unsupervised 3D semantic scene understanding! Congrats to @craigleili @david_roz_ @chrdiller @yawarnihal @shivangi2201 @jiapeng_tang @AnhQuanCAO for their amazing work!
3
29
118
@angelaqdai
Angela Dai
2 years
Check out @chrdiller's CG-HOI :) We generate realistic 3D human-object interactions, from object geometry and text description. A key ingredient is explicit modeling of contact, during training and as guidance during inference. https://t.co/Cl5Jw9oFBO https://t.co/FVIFqEpjHi
5
53
202
@MattNiessner
Matthias Niessner
2 years
Diffusion models are awesome! Check out our survey on ๐ƒ๐ข๐Ÿ๐Ÿ๐ฎ๐ฌ๐ข๐จ๐ง ๐Œ๐จ๐๐ž๐ฅ๐ฌ ๐Ÿ๐จ๐ซ ๐•๐ข๐ฌ๐ฎ๐š๐ฅ ๐‚๐จ๐ฆ๐ฉ๐ฎ๐ญ๐ข๐ง๐ ! We give an introduction to diffusion models and highlight how they are used by state-of-the-art methods in graphics and vision. https://t.co/FqaqF7tMPM
4
87
378
@angelaqdai
Angela Dai
2 years
We've released the ScanNet++ data! Check it out: https://t.co/SKCGM23hA0 280 high-fidelity 3D scenes w/ 1mm geometry, DSLR+iPhone images, semantics We're currently beta-testing, please bear with us - approval may initially take up to 2 weeks Test scenes and benchmark to come!
0
41
164
@MattNiessner
Matthias Niessner
2 years
Can we match visual features jointly across multiple frames? Yes! @barbara_roessle's #ICCV2023 paper proposes a differentiable pose optimization for end2end feature matching across multiple frames, thus obtaining better poses! https://t.co/CCYtA5PxCS https://t.co/cM0gaG3ids
1
92
385