Here is my personal website.
- 🔭 I’m currently working on Large Multi-modality Model and data-centric AI.
Here is my personal website.
[A toolbox for fun.] Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.
Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)
[CVPR2023] All in One: Exploring Unified Video-Language Pre-training
[AAAI2021] The source code for our paper 《Enhancing Unsupervised Video Representation Learning by Decoupling the Scene and the Motion》.
[CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》