AI & ML interests

Efficient machine learning for any model and hardware: pruning, quantization, compilation, and more.

Recent Activity

Articles

sdiazlor 
published an article about 1 month ago
view article
Article

Pruna 0.3.2: More OSS Algos, More Ways to Optimize

4
sdiazlor 
published an article about 2 months ago
view article
Article

LLM Architectures Explained: What Powers Today’s Top Models

8
sdiazlor 
published an article 3 months ago
view article
Article

Slashing torch.compile Warmup & LoRA Swapping Times with Pruna

5
davidberenstein1957 
published an article 3 months ago
view article
Article

SmolLM-Smashed: Tiny Giants, Optimized for Speed

15
sdiazlor 
published an article 5 months ago
view article
Article

AI Model Optimization More Flexible Than Ever

13
sdiazlor 
published an article 5 months ago
view article
Article

Effective Prompting for Generative Vision Models

9
davidberenstein1957 
published an article 11 months ago
view article
Article

Measuring What Matters: Objective Metrics for Image Generation Assessment

10
davidberenstein1957 
published an article 11 months ago
view article
Article

Faster ComfyUI Nodes for Flux and Stable Diffusion with Pruna

8
davidberenstein1957 
published an article 12 months ago
view article
Article

🔥 Announcing FLUX-Juiced: The Fastest Image Generation Endpoint (2.6 times faster)!

12
davidberenstein1957 
published an article about 1 year ago
view article
Article

An Introduction to AI Model Optimization Techniques

28
davidberenstein1957 
published an article about 1 year ago
view article
Article

Optimise AI Models and Make Them Faster, Smaller, Cheaper, Greener

18