2023.03.09 ArXiv精選

2023-03-09 19:41 作者:PaperABC 0人讀過 | 我要投稿

論文較多，時間有限，本專欄無法做文章的講解，只挑選出符合PaperABC研究興趣和當前熱點問題相關的論文，如果你的research topic和上述內(nèi)容有關，那本專欄可作為你的論文更新源或Paper reading list．

Paper list:

今日ArXiv共更新81篇.

X-Avatar: Expressive Human Avatars

https://arxiv.org/pdf/2303.04805.pdf

ETＨ和微軟的合作工作．本文的方法能夠以整體的形式對人體，手部，面部表情和外貌進行建模．并且可以從簡單的RGB-D或者3D掃描數(shù)據(jù)中就能學到．

Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models

https://arxiv.org/pdf/2303.04803.pdf

英偉達的工作．本文同時利用了Text2Image Diffusion模型的強大的open-vocabulary能力和CLIP強大的鑒別能力，完成open-vocabulary的Panoptic Segmentation.

Video-P2P: Video Editing with Cross-attention Control

https://arxiv.org/pdf/2303.04761.pdf

港中文的一篇工作．提出了Video-P2P方法，利用圖像模態(tài)的擴散模型實現(xiàn)了視頻端的編輯任務．近期會分享．

CLIP-FO3D: Learning Free Open-world 3D Scene Representations from 2D Dense CLIP

https://arxiv.org/pdf/2303.04748.pdf

清華團隊的工作，利用CLIP的預訓練知識來增強open-world下3D場景的理解能力．

Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models

https://arxiv.org/pdf/2303.04671.pdf

微軟亞洲研究院的一篇工作，將Foundation model和ChatGPT結合，打造了更加靈活，功能豐富的Visual ChatGPT.

微軟亞洲研究院的一篇工作，將Foundation model和ChatGPT結合，打造了更加靈活，功能豐富的Visual ChatGPT.

標簽：