'Study: Artificial Intelligence(AI)/AI: MultiModal' 카테고리의 글 목록

Study: Artificial Intelligence(AI)/AI: MultiModal

[논문 리뷰] LGTM: Body Part Embedding - LGTM: Local-to-Global Text-Driven Human Motion Diffusion Model (SIGGRAPH 2024)

2024.10.30

보호되어 있는 글입니다.

Study: Artificial Intelligence(AI)/AI: MultiModal

[논문 리뷰] GroupViT: Semantic Segmentation Emerges from Text Supervision (CVPR 2022)

2024.10.03

보호되어 있는 글입니다.

Study: Artificial Intelligence(AI)/AI: MultiModal

[논문리뷰] CLIP: Vision Language Multimodal dataset - CLIP: Learning transferable visual models from natural language supervision

2024.03.31

💡 본 문서는 'CLIP: Learning transferable visual models from natural language supervision' 논문을 정리해놓은 글입니다.해당 논문은 CLIP 같은 멀티모달 모델의 language embedding을 NeRF 안에 집어넣어 NeRF를 Multi Modal로 확장 가능성을 보여준 논문이니 참고하시기 바랍니다. - Project: https://www.lerf.io/ - Paper: https://arxiv.org/abs/2303.09553 - Github: https://github.com/kerrj/lerf - Dataset: https://drive.google.com/drive/folders/1vh0mSl7v29yaGsxleadcj-LCZO..

일	월	화	수	목	금	토
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30

[논문 리뷰] LGTM: Body Part Embedding - LGTM: Local-to-Global Text-Driven Human Motion Diffusion Model (SIGGRAPH 2024)

[논문 리뷰] GroupViT: Semantic Segmentation Emerges from Text Supervision (CVPR 2022)

[논문리뷰] CLIP: Vision Language Multimodal dataset - CLIP: Learning transferable visual models from natural language supervision

티스토리툴바