Change the repository type filter
All
Repositories list
74 repositories
- A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
- Repository for ShowUI: One Vision-Language-Action Model for GUI Visual Agent
- 💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.
MovieBench
Public- Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
- [ICCV 2023] BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion
sparseformer
Public(ICLR 2024, CVPR 2024) SparseFormerLOVA3
Public(NeurIPS 2024) Learning to Visual Question Answering, Asking and AssessmentVisInContext
PublicExo2Ego-V
Publicwatermark-steganalysis
Publicvideogui
PublicEvolveDirector
PublicMovieSeq
PublicGUI-Narrator
PublicRingID
Public- [ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.
videollm-online
PublicX-Adapter
Publicafformer
PublicDragAnything
PublicAssistGaze
Public