thisisvaze (Aaditya Vaze)

liked a model 4 months ago

HuggingFaceTB/SmolVLM-256M-Instruct

Image-Text-to-Text • 0.3B • Updated Apr 8, 2025 • 362k • 349

liked a model 6 months ago

neuphonic/neutts-air

Text-to-Speech • 0.7B • Updated Feb 12 • 10.3k • 867

liked 3 models 7 months ago

liked a Space 11 months ago

Chatterbox TTS

🍿

1.73k

Expressive Zeroshot TTS

liked a Space 12 months ago

Dia 1.6B

👯

1.77k

Generate realistic dialogue from a script, using Dia!

liked a model about 1 year ago

sesame/csm-1b

Text-to-Speech • Updated Dec 1, 2025 • 152k • 2.36k

liked a Space about 1 year ago

Spark TTS

🌖

229

A text-to-speech model powered by SparkAudio and Mobvoi.

liked 2 models about 1 year ago

HuggingFaceTB/SmolVLM2-500M-Video-Instruct

Image-Text-to-Text • Updated Apr 8, 2025 • 353k • 132

microsoft/Phi-4-multimodal-instruct

Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 333k • 1.58k

liked a Space about 1 year ago

Kokoro Text-to-Speech (WebGPU)

🗣

358

High-quality speech synthesis powered by Kokoro TTS

liked a model about 1 year ago

mlx-community/SmolVLM2-500M-Video-Instruct-mlx

Video-Text-to-Text • Updated Feb 20, 2025 • 2.29k • 18

liked a Space about 2 years ago

InstantID

😻

3.59k

Generate a custom image that keeps your face identity

liked a model over 2 years ago

meta-llama/Llama-2-7b

Text Generation • Updated Apr 17, 2024 • 317 • 4.47k

liked a Space almost 3 years ago

InstructBLIP

📊

48

Instruction-tuned model for a range of vision-language tasks

liked a Space about 3 years ago

Prismer

🔺

137

liked a Space over 3 years ago

Stable Diffusion Multiplayer

👥

351

liked a model over 3 years ago

facebook/bart-base

Feature Extraction • 0.1B • Updated Nov 16, 2022 • 545k • • 204

liked a Space over 3 years ago

CLIPnCROP

🌍

33

Extract and crop image sections based on text description

Aaditya Vaze

AI & ML interests

Organizations

HuggingFaceTB/SmolVLM-256M-Instruct

neuphonic/neutts-air

moondream/moondream3-preview

stabilityai/stable-audio-open-1.0

Marvis-AI/marvis-tts-250m-v0.1

Chatterbox TTS

Dia 1.6B

sesame/csm-1b

Spark TTS

HuggingFaceTB/SmolVLM2-500M-Video-Instruct

microsoft/Phi-4-multimodal-instruct

Kokoro Text-to-Speech (WebGPU)

mlx-community/SmolVLM2-500M-Video-Instruct-mlx

InstantID

meta-llama/Llama-2-7b

InstructBLIP

Prismer

Stable Diffusion Multiplayer

facebook/bart-base

CLIPnCROP

Aaditya Vaze

AI & ML interests

Organizations

thisisvaze's activity

Chatterbox TTS

Dia 1.6B

Spark TTS

Kokoro Text-to-Speech (WebGPU)

InstantID

InstructBLIP

Prismer

Stable Diffusion Multiplayer

CLIPnCROP