Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Building on HF
4
Krishna Teja Chitty-Venkata
krishnateja95
Follow
memani's profile picture
sraskar's profile picture
21world's profile picture
4 followers
·
9 following
https://krishnateja95.github.io/
krishnateja95
kt95
AI & ML interests
LLM Optimization, Neural Architecture Search, Quantization, Pruning
Recent Activity
updated
a model
4 days ago
inference-optimization/Qwen3-30B-A3B-Instruct-2507-6.5bits
published
a model
4 days ago
inference-optimization/Qwen3-30B-A3B-Instruct-2507-6.5bits
updated
a model
4 days ago
inference-optimization/Qwen3-30B-A3B-Instruct-2507-5bits
View all activity
Organizations
krishnateja95
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a model
4 days ago
inference-optimization/Qwen3-30B-A3B-Instruct-2507-6.5bits
25B
•
Updated
4 days ago
•
12
published
a model
4 days ago
inference-optimization/Qwen3-30B-A3B-Instruct-2507-6.5bits
25B
•
Updated
4 days ago
•
12
updated
a model
4 days ago
inference-optimization/Qwen3-30B-A3B-Instruct-2507-5bits
20B
•
Updated
4 days ago
•
13
published
a model
4 days ago
inference-optimization/Qwen3-30B-A3B-Instruct-2507-5bits
20B
•
Updated
4 days ago
•
13
updated
a model
4 days ago
inference-optimization/Qwen3-30B-A3B-Instruct-2507-5.5bits
22B
•
Updated
4 days ago
•
11
published
a model
4 days ago
inference-optimization/Qwen3-30B-A3B-Instruct-2507-5.5bits
22B
•
Updated
4 days ago
•
11
updated
a model
4 days ago
inference-optimization/Qwen3-30B-A3B-Instruct-2507-7bits
27B
•
Updated
4 days ago
•
15
published
a model
4 days ago
inference-optimization/Qwen3-30B-A3B-Instruct-2507-7bits
27B
•
Updated
4 days ago
•
15
updated
a model
4 days ago
inference-optimization/Qwen3-30B-A3B-Instruct-2507-6.75bits
26B
•
Updated
4 days ago
•
15
published
a model
4 days ago
inference-optimization/Qwen3-30B-A3B-Instruct-2507-6.75bits
26B
•
Updated
4 days ago
•
15
updated
a model
4 days ago
inference-optimization/Qwen3-30B-A3B-Instruct-2507-6.25bits
24B
•
Updated
4 days ago
•
11
published
a model
4 days ago
inference-optimization/Qwen3-30B-A3B-Instruct-2507-6.25bits
24B
•
Updated
4 days ago
•
11
updated
a model
4 days ago
inference-optimization/Qwen3-30B-A3B-Instruct-2507-5.75bits
22B
•
Updated
4 days ago
•
12
published
a model
4 days ago
inference-optimization/Qwen3-30B-A3B-Instruct-2507-5.75bits
22B
•
Updated
4 days ago
•
12
updated
a model
4 days ago
inference-optimization/Qwen3-30B-A3B-Instruct-2507-5.25bits
21B
•
Updated
4 days ago
•
11
published
a model
4 days ago
inference-optimization/Qwen3-30B-A3B-Instruct-2507-5.25bits
21B
•
Updated
4 days ago
•
11
updated
a model
4 days ago
inference-optimization/Qwen3-30B-A3B-Instruct-2507-6bits
23B
•
Updated
4 days ago
•
64
published
a model
6 days ago
inference-optimization/Qwen3-30B-A3B-Instruct-2507-6bits
23B
•
Updated
4 days ago
•
64
updated
2 models
about 1 month ago
inference-optimization/Meta-Llama-3.1-8B-Instruct-NVFP4-FP8-Dynamic_6.5-bits
7B
•
Updated
Jan 26
inference-optimization/Meta-Llama-3.1-8B-Instruct-NVFP4-FP8-Dynamic_6.25-bits
6B
•
Updated
Jan 26
Load more