Skip to content
View wangzheallen's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report wangzheallen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Fast and memory-efficient classical machine learning operators

Python 540 43 Updated Jun 23, 2026

(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis

Python 1,385 72 Updated Mar 5, 2025

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,703 570 Updated Nov 10, 2025

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

7,617 931 Updated Aug 21, 2024

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 8,641 796 Updated May 31, 2024

🔥🔥🔥Official Codebase of "DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation"

Python 320 24 Updated May 17, 2024

OneDiff: An out-of-the-box acceleration library for diffusion models.

Jupyter Notebook 1,963 129 Updated Dec 4, 2025

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 3,994 406 Updated Feb 27, 2025

Fast Diffusion Models with Transformers

Python 949 123 Updated Aug 17, 2025

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Python 6,964 512 Updated Dec 13, 2025

Modeling, training, eval, and inference code for OLMo

Python 6,559 775 Updated Nov 24, 2025

InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥

Python 11,954 882 Updated Jul 18, 2024

Building a quick conversation-based search demo with Lepton AI.

TypeScript 8,095 1,004 Updated Dec 2, 2025

[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing

Python 1,439 88 Updated Sep 7, 2023

Character Animation (AnimateAnyone, Face Reenactment)

Python 3,507 296 Updated May 31, 2024

[ECCV 2024 Best Paper Candidate & TPAMI 2025] PointLLM: Empowering Large Language Models to Understand Point Clouds

Python 1,029 58 Updated May 15, 2026

[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects

Python 3,323 496 Updated Apr 29, 2026

An open-source framework for training large multimodal models.

Python 4,108 320 Updated Aug 31, 2024

Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation

Jupyter Notebook 4,441 729 Updated Jun 22, 2024

[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language

Python 665 37 Updated Oct 22, 2024

Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN

Python 3,636 647 Updated May 15, 2024

A simple, performant and scalable Jax LLM!

Python 2,335 542 Updated Jun 25, 2026

High-speed Large Language Model Serving for Local Deployment

C++ 9,593 584 Updated May 11, 2026

[T-PAMI 2025] PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm

Python 374 8 Updated Sep 30, 2025

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,939 875 Updated Jun 10, 2024

[ICCV 2023] StreamPETR: Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection

Python 818 96 Updated Jun 26, 2024

Code for paper: FUTR3D: a unified sensor fusion framework for 3d detection

Python 349 46 Updated Jul 6, 2023

Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" (DragGAN 全功能实现,在线Demo,本地部署试用,代码、模型已全部开源,支持Windows, macOS, Linux)

Python 4,952 477 Updated Jul 17, 2023

BEVFormer inference on TensorRT, including INT8 Quantization and Custom TensorRT Plugins (float/half/half2/int8).

Python 575 103 Updated Nov 20, 2023

[ICLR 2023] DiffMimic: Efficient Motion Mimicking with Differentiable Physics https://arxiv.org/abs/2304.03274

Python 307 21 Updated Jan 22, 2025
Next