Explore

I want to…

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Upscale images

Upscaling models that create high-quality images from low-quality images

Train a language model

Language models that you can fine-tune using Replicate's training API.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Latest models

Updated 133 runs

Updated 7 runs

Updated 4.2K runs

Implementation of the RemBG library

Updated 129 runs

A tiny model for testing out Cog

Updated 58 runs

BLIP3 is a series of foundational Large Multimodal Models (LMMs) developed by Salesforce AI Research

Updated 112 runs

Blip 3 (blip3-phi3-mini-instruct-r-v1), Answers questions about images

Updated 51 runs

viⓍTTS vixTTS là mô hình tạo sinh giọng nói cho phép bạn sao chép giọng nói sang các ngôn ngữ khác nhau chỉ bằng cách sử dụng một đoạn âm thanh nhanh dài 6 giây

Updated 131 runs

Transcribe audios using OpenAI's Whisper with stabilizing timestamps by stable-ts python package.

Updated 26 runs

Updated 3 runs

Auto-magically relights your images

Updated 547 runs

Use a face to make images. Uses SDXL fine-tuned checkpoints.

Updated 670 runs

Updated 33 runs

Use a face to instantly make images. Uses SDXL Lightning checkpoints.

Updated 543 runs

Run any ComfyUI workflow. Guide: https://github.com/fofr/cog-comfyui

Updated 201.1K runs

Updated 57 runs

Cog to turn minimally-formatted plaintext into pdfs (using tex on the backend)

Updated 28 runs

Dark Sushi Mix 2.25D Model with vae-ft-mse-840000-ema (Text2Img, Img2Img and Inpainting)

Updated 39.4K runs

DeepSeek LLM, an advanced language model comprising 67 billion parameters. Trained from scratch on a vast dataset of 2 trillion tokens in both English and Chinese

Updated 21 runs

Updated 127 runs

turns text into pdf files with TeX

Updated 10 runs

Meta Llama Guard 2 is an 8B parameter Llama 3-based LLM safeguard model

Updated 60 runs

Hermes 2 Pro is an updated and cleaned version of the OpenHermes 2.5 Dataset, as well as a newly introduced Function Calling and JSON Mode dataset developed in-house

Updated 66 runs

Updated 9 runs

Virtual try-on using Stable Diffusion and IP-Adapter

Updated 103 runs

Updated 100 runs

a fine-tuned model to detect dragon in images.

Updated 22 runs

InstantID. ControlNets. More base SDXL models. And the latest ByteDance's ⚡️SDXL-Lightning !⚡️

Updated 90.6K runs

The img2img pipeline that makes an anime-style image of a person. It uses one of sd1.5 models as a base, depth-estimation as a ControleNet and IPadapter model for face consistency.

Updated 72 runs

GPU accelerated replay renderer / video data clipper for comma.ai connect's openpilot route data. SEE README.

Updated 2.1K runs

Consistent Self-Attention for Long-Range Image and Video Generation

Updated 4.2K runs

Updated 569 runs

Robust face restoration algorithm for old photos / AI-generated faces (adapted to work with video inputs)

Updated 75 runs

Updated 34 runs

📖 PuLID: Pure and Lightning ID Customization via Contrastive Alignment

Updated 15K runs

Semantic Segmentation

Updated 119.3K runs

A SDXL Model trained from another SDXL-hiroshinagai model images

Updated 102 runs

Train SDXL 1.0 with LoRA | mixed precision bf16 and save precision fp16

Updated 230 runs

Just some good ole beautifulsoup scrapping URL magic. (some sites don't work as they block scrapping, but still useful)

Updated 2.1K runs

High resolution image Upscaler and Enhancer. Use at ClarityAI.cc. A free Magnific alternative. Twitter/X: @philz1337x

Updated 1.3M runs

Realistic Inpainting with ControlNET (M-LSD + SEG)

Updated 9.1K runs

Tango2: LLM-guided Diffusion-based Text-to-Audio Generation and DPO-based Alignment

Updated 18.5K runs

🗣️ TalkNet-ASD: Detect who is speaking in a video

Updated 57 runs

Transfer a material from an image to a subject

Updated 3.3K runs

Updated 30 runs

Demucs is an audio source separator created by Facebook Research.

Updated 340.7K runs

Updated 79 runs

Creates voxels like game asset

Updated 225 runs

Projection module trained to add vision capabilties to Llama 3 using SigLIP

Updated 4.2K runs

Updated 72 runs