[논문리뷰] LiDAR2Map: LiDAR-based distillation scheme - LiDAR2Map: In Defense of LiDAR-Based Semantic Map Construction Using Online Camera Distillation (CVPR 2023)

💡 본 문서는 'LiDAR2Map: In Defense of LiDAR-Based Semantic Map Construction Using Online Camera Distillation' 논문을 정리해놓은 글입니다.
해당 논문은 CLIP 같은 멀티모달 모델의 language embedding을 NeRF 안에 집어넣어 NeRF를 Multi Modal로 확장 가능성을 보여준 논문이니 참고하시기 바랍니다.
- Paper: https://openaccess.thecvf.com/content/CVPR2023/papers/Wang_LiDAR2Map_In_Defense_of_LiDAR-Based_Semantic_Map_Construction_Using_Online_CVPR_2023_paper.pdf
- Github: https://github.com/songw-zju/LiDAR2Map
- Youtube: https://www.youtube.com/watch?v=nr25xFZbx8U

Contribution

BEV Feature Pyramid Decoder (BEV-FPD)
LiDAR-based network: an online Camera-to-LiDAR distillation scheme.
- mainly use LiDAR data and only extract image features as auxiliary network during training.
- Feature Distill + Logit Distill

LiDAR2Map Framework

1. BEV Feature Pyramid Decoder (BEV-FPD)

2. Position-Guided Feature Fusion Module (PGF2M)

we take advantage of the multi-scale BEV features {F˜BEV i } N i=1 from BEV-FPD for the feature-level distillation.

+ feature fusion module

knowledge distilation
카메라 이미지에서 얻은 풍부한 의미 정보를 활용하여 LiDAR 모델의 성능을 향상시키는 데 사용
실제 test에는 Lidar 시퀀스만 실행되니 속도적으로도 이득

저작자표시 비영리 변경금지 (새창열림)

'Study: Artificial Intelligence(AI) > AI: 3D Vision' 카테고리의 다른 글

[논문 리뷰] PlenOctrees for NeRF (ICCV 2021) : 랜더링 속도 개선 논문 (0)	2024.07.20
[논문리뷰] Lift, Splat, Shoot: LSS, Frustum, PointPillar - Encoding Images from Arbitrary Camera Rigs by Implicitly Unprojecting to 3D (ECCV 2020) (0)	2024.07.17
[논문리뷰] UniTR: modality-agnostic transformer encoder - A Unified and Efficient Multi-Modal Transformer for BEV Representation (2023 ICCV) (0)	2024.07.16
[논문리뷰] DSVT: Voxel Transformer - Dynamic Sparse Voxel Transformer with Rotated Sets (CVPR 2023) (0)	2024.07.15
[논문리뷰] PETRv2: PETR + Temporal + Multi-Task - A Unified Framework for 3D Perception from Multi-Camera Images (2023 ICCV) (0)	2024.07.13

일	월	화	수	목	금	토
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30

Contribution

LiDAR2Map Framework

1. BEV Feature Pyramid Decoder (BEV-FPD)

2. Position-Guided Feature Fusion Module (PGF2M)

+ feature fusion module

'Study: Artificial Intelligence(AI) > AI: 3D Vision' 카테고리의 다른 글

티스토리툴바