手機站首頁散文詩歌雜文隨筆日記小小說

散文網(wǎng) » 生活 »日常 » 2023.04.04 ArXiv精選

2023.04.04 ArXiv精選

2023-04-06 09:31 作者:PaperABC 0人讀過 | 我要投稿

關(guān)注領(lǐng)域：

AIGC
3D computer vision learning
Fine-grained learning
GNN
其他

聲明

論文較多，時間有限，本專欄無法做文章的講解，只挑選出符合PaperABC研究興趣和當(dāng)前熱點問題相關(guān)的論文，如果你的research topic和上述內(nèi)容有關(guān)，那本專欄可作為你的論文更新源或Paper reading list．

Paper list:

今日ArXiv共更新151篇

AIGC

Follow Your Pose: Pose-Guided Text-to-Video Generation using Pose-Free Videos

https://arxiv.org/pdf/2304.01186.pdf

騰訊的工作，視頻領(lǐng)域的controlNet.

Robust Text-driven Image Editing Method that Adaptively Explores Directions in Latent Spaces of StyleGAN and CLIP

https://arxiv.org/pdf/2304.00964.pdf

文本驅(qū)動的圖像編輯方法.

DreamAvatar: Text-and-Shape Guided 3D Human Avatar Generation via Diffusion Models

https://arxiv.org/pdf/2304.00916.pdf

太卷了，太卷了，自己看吧，朋友們。

VLP

Vision-Language Models for Vision Tasks: A Survey

https://arxiv.org/pdf/2304.00685.pdf

視覺語言預(yù)訓(xùn)練模型的最新綜述

RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene Understanding

https://arxiv.org/pdf/2304.00962.pdf

區(qū)域級別的點云和文本的對比學(xué)習(xí)，主要用于open-world的3D場景理解。

AirLoc: Object-based Indoor Relocalization

https://arxiv.org/pdf/2304.00954.pdf

CMU的室內(nèi)場景定位工作.

Multi-Modal Representation Learning with Text-Driven Soft Masks

https://arxiv.org/pdf/2304.00719.pdf

標(biāo)簽：