Explore

zsxkib/pulid

📖 PuLID: Pure and Lightning ID Customization via Contrastive Alignment

15K runs

iordcalin/material-transfer

Transfer a material from an image to a subject

3.3K runs

cjwbw/openvoice

Updated to OpenVoice v2: Versatile Instant Voice Cloning

7K runs

snowflake/snowflake-arctic-instruct

An efficient, intelligent, and truly open-source language model

247.1K runs

meta/meta-llama-3-70b-instruct

A 70 billion parameter language model from Meta, fine tuned for chat completions

10.7M runs

vaibhavs10/incredibly-fast-whisper

whisper-large-v3, incredibly fast, powered by Hugging Face Transformers! 🤗

349.1K runs

I want to…

Generate images

Models that generate images from text prompts

Edit images

Tools for manipulating images.

Caption images

Models that generate text from images

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Upscale images

Upscaling models that create high-quality images from low-quality images

Get embeddings

Models that generate embeddings from inputs

Use a language model

Models that can understand and generate text

Extract text from images

Optical character recognition (OCR) and text extraction

Train a language model

Language models that you can fine-tune using Replicate's training API.

Use a face to make images

Make realistic images of people instantly

Chat with images

Ask language models about images

Transcribe speech

Models that convert speech to text

Use handy tools

Toolbelt-type models for videos and images.

Generate music

Models to generate and modify music

Generate videos

Models that create and edit videos

Generate speech

Convert text to speech

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Get structured data

Language models that support grammar-based decoding as well as jsonschema constraints.

Popular models

bytedance/sdxl-lightning-4step

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

Updated 2 months, 2 weeks ago 36.1M runs

yorickvp/llava-13b

Visual instruction tuning towards large language and vision models with GPT-4 level capabilities

Updated 1 month, 2 weeks ago 7.7M runs

openai/whisper

Convert speech in audio to text

Updated 6 months ago 7.1M runs

salesforce/blip

Bootstrapping Language-Image Pre-training

Updated 1 year, 7 months ago 80.6M runs

stability-ai/sdxl

A text-to-image generative AI model that creates beautiful images

Updated 6 months ago 50.4M runs

xinntao/gfpgan

Practical face restoration algorithm for *old photos* or *AI-generated faces*

Updated 1 year, 7 months ago 5.8M runs

nightmareai/real-esrgan

Real-ESRGAN with optional face correction and adjustable upscale

Updated 3 weeks ago 44M runs

lucataco/qwen-vl-chat

A multimodal LLM-based AI assistant, which is trained with alignment techniques. Qwen-VL-Chat supports more flexible interaction, such as multi-round question answering, and creative capabilities.

Updated 6 months, 3 weeks ago 568.8K runs

Latest models

expa-ai/dove-hairstyle-campaign

Updated 4 minutes ago 133 runs

hadilq/hair-segment

Updated 5 hours ago 7 runs

decorx-ai/augment-experiments

Updated 13 hours ago 4.2K runs

re-mix-1/rembg

Implementation of the RemBG library

Updated 1 day ago 129 runs

zeke/hello-world

A tiny model for testing out Cog

Updated 1 day, 3 hours ago 58 runs

lucataco/blip3-phi3-mini-instruct-r-v1

BLIP3 is a series of foundational Large Multimodal Models (LMMs) developed by Salesforce AI Research

Updated 1 day, 3 hours ago 112 runs

zsxkib/blip-3

Blip 3 (blip3-phi3-mini-instruct-r-v1), Answers questions about images

Updated 1 day, 3 hours ago 51 runs

suminhthanh/vixtts

viⓍTTS vixTTS là mô hình tạo sinh giọng nói cho phép bạn sao chép giọng nói sang các ngôn ngữ khác nhau chỉ bằng cách sử dụng một đoạn âm thanh nhanh dài 6 giây

Updated 1 day, 9 hours ago 131 runs

hovevideo/stable-whisper

Transcribe audios using OpenAI's Whisper with stabilizing timestamps by stable-ts python package.

Updated 1 day, 22 hours ago 26 runs

sinazar/campfire-deer3-may9

Updated 1 day, 23 hours ago 3 runs

zsxkib/ic-light

Auto-magically relights your images

Updated 2 days, 6 hours ago 547 runs

fofr/pulid-base

Use a face to make images. Uses SDXL fine-tuned checkpoints.

Updated 2 days, 6 hours ago 670 runs

sourav-sarkar-doc32/smile-correct

Updated 2 days, 7 hours ago 33 runs

fofr/pulid-lightning

Use a face to instantly make images. Uses SDXL Lightning checkpoints.

Updated 2 days, 10 hours ago 543 runs

fofr/any-comfyui-workflow

Run any ComfyUI workflow. Guide: https://github.com/fofr/cog-comfyui

Updated 2 days, 12 hours ago 201.1K runs

fszotyi/sdxl-car

Updated 2 days, 14 hours ago 57 runs

georgedavila/cog-easytex

Cog to turn minimally-formatted plaintext into pdfs (using tex on the backend)

Updated 2 days, 23 hours ago 28 runs

asiryan/dark-sushi-mix-225d

Dark Sushi Mix 2.25D Model with vae-ft-mse-840000-ema (Text2Img, Img2Img and Inpainting)

Updated 3 days, 23 hours ago 39.4K runs

lucataco/deepseek-67b-base

DeepSeek LLM, an advanced language model comprising 67 billion parameters. Trained from scratch on a vast dataset of 2 trillion tokens in both English and Chinese

Updated 4 days ago 21 runs

remodela-ai/style-materials-transfer

Updated 4 days, 1 hour ago 127 runs

georgedavila/cog-tex2pdf

turns text into pdf files with TeX

Updated 4 days, 3 hours ago 10 runs

meta/meta-llama-guard-2-8b

Meta Llama Guard 2 is an 8B parameter Llama 3-based LLM safeguard model

Updated 4 days, 3 hours ago 60 runs

lucataco/hermes-2-pro-llama-3-8b

Hermes 2 Pro is an updated and cleaned version of the OpenHermes 2.5 Dataset, as well as a newly introduced Function Calling and JSON Mode dataset developed in-house

Updated 5 days ago 66 runs

muqtadar08/llm_finetuning_dataset_generator

Updated 5 days, 15 hours ago 9 runs

wolverinn/ecommerce-virtual-try-on

Virtual try-on using Stable Diffusion and IP-Adapter

Updated 5 days, 16 hours ago 103 runs

chamuditha4/cartoonizer

Updated 6 days, 2 hours ago 100 runs

hadilq/dragon-notdragon

a fine-tuned model to detect dragon in images.

Updated 6 days, 3 hours ago 22 runs

tgohblio/instant-id-multicontrolnet

InstantID. ControlNets. More base SDXL models. And the latest ByteDance's ⚡️SDXL-Lightning !⚡️

Updated 6 days, 10 hours ago 90.6K runs

kitaef/mytestmodel

The img2img pipeline that makes an anime-style image of a person. It uses one of sd1.5 models as a base, depth-estimation as a ControleNet and IPadapter model for face consistency.

Updated 6 days, 11 hours ago 72 runs