Highlights
- Pro
Stars
Python tool for converting files and office documents to Markdown.
Portfolio builder designed specifically for software developers. It allows professionals to showcase their projects, skills, and experience in a sleek, "hacker-chic" interface that emphasizes code …
UniFace: A Unified Face Analysis Library for Python | A comprehensive suite of high-performance face analysis components that outperform leading alternatives.
HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation
A comprehensive receipt processing system built with PaddleOCR, featuring intelligent text extraction, receipt data parsing, batch processing, and spending analysis capabilities.
Digital Human Resource: 2D/3D/4D Human Modeling, Avatar Generation & Animation, Clothed People Digitalization, Virtual Try-On, etc.
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code
Summary of publicly available ressources such as code, datasets, and scientific papers for the FLAME 3D head model
A curated list of resources of audio-driven talking face generation
DECA: Detailed Expression Capture and Animation (SIGGRAPH 2021)
An open-source AI agent that brings the power of Gemini directly into your terminal.
We segmented the Brain tumor using Brats dataset and as we know it is in 3D format we used the slicing method in which we slice the images in 2D form according to its 3 axis and then giving the mod…
This repository contains code for fine-tuning Google's PaliGemma vision-language model on the Flickr8k dataset for image captioning tasks
A concise image-captioning pipeline that fine-tunes a ViT encoder with a BERT decoder on Flickr8K for training, plus a standalone script to load the trained model and generate captions on new images.
This code implements a Convolutional Neural Network (CNN) to classify plant diseases using the PlantVillage dataset. It includes the full pipeline for data preparation, model training, evaluation, …
Pytorch implementation of VQGAN (Taming Transformers for High-Resolution Image Synthesis) (https://arxiv.org/pdf/2012.09841.pdf)
Pytorch implementation of Diffusion Models (https://arxiv.org/pdf/2006.11239.pdf)
PyTorch implementations of Generative Adversarial Networks.
Release for Improved Denoising Diffusion Probabilistic Models
In this project we used a k-nearest neighbors algorithm (KNN) to recommend a book based on your previous book prefrecnces.
In this Project we will train our RNN model by giving it different tweets and then predict the sentiments of the tweets.
In this project we are using LSTM to classify texts as spam or ham.
In this project we will predict the cost required for a patient depending on his/her health conditions.
In this project we will classify Dog and Cat images using Convolution Neural Network (CNN)
I am an Artificial Intelligence Engineer with expertise in Machine Learning and Computer Vision.
Linear Regression using Matlab on a Kaggle dataset.
[NeurIPS 2024] Using vision-language models to decode natural image perception from non-invasive brain recordings.


