Open to Work

Joseph Robert Turcotte PRO

Fishtiks

AI & ML interests

Roleplaying, lorabration, abliteration, smol models, extensive filtering, unusual datasets, home usage, HPCs for AI, distributed training/federated learning, and sentience. AI should find and label AI hallucinations with GANs so we can give them context and use.

Recent Activity

liked a model about 5 hours ago

qwp4w3hyb/Meta-Llama-3.1-70B-Instruct-iMat-GGUF

upvoted a paper about 16 hours ago

BM25S: Orders of magnitude faster lexical search via eager sparse scoring

upvoted a paper about 16 hours ago

Multi-Task GRPO: Reliable LLM Reasoning Across Tasks

View all activity

Organizations

liked a model about 5 hours ago

qwp4w3hyb/Meta-Llama-3.1-70B-Instruct-iMat-GGUF

Text Generation • 71B • Updated Aug 6, 2024 • 3.02k • 8

upvoted 4 papers about 16 hours ago

BM25S: Orders of magnitude faster lexical search via eager sparse scoring

Paper • 2407.03618 • Published Jul 4, 2024 • 14

Multi-Task GRPO: Reliable LLM Reasoning Across Tasks

Paper • 2602.05547 • Published 5 days ago • 11

DASH: Faster Shampoo via Batched Block Preconditioning and Efficient Inverse-Root Solvers

Paper • 2602.02016 • Published 8 days ago • 11

BABE: Biology Arena BEnchmark

Paper • 2602.05857 • Published 4 days ago • 10

upvoted a collection about 16 hours ago

propella-1

Collection

Small multilingual LLMs for annotating and curating LLM training data. • 4 items • Updated 26 days ago • 4

liked a Space about 16 hours ago

NSFW FLUX Uncensored Photo

⚡

599

NSFW FLUX Uncensored photo 'Text & Imagery for AI Limits'

reacted to mitkox's post with 👍 about 16 hours ago

Post

3178

I just pushed Claude Code Agent Swarm with 20 coding agents on my desktop GPU workstation.

With local AI, I don’t have /fast CC switch, but I have /absurdlyfast:
- 100’499 tokens/second read, yeah 100k, not a typo | 811 tok/sec generation
- KV cache: 707’200 tokens
- Hardware: 5+ year old GPUs 4xA6K gen1; It’s not the car. It’s the driver.

Qwen3 Coder Next AWQ with cache at BF16. Scores 82.1% in C# on 29-years-in-dev codebase vs Opus 4.5 at only 57.5%. When your codebase predates Stack Overflow, you don't need the biggest model; you need the one that actually remembers Windows 95.

My current bottleneck is my 27" monitor. Can't fit all 20 Theos on screen without squinting.

1 reply

upvoted a paper about 16 hours ago

Learning to Reason in 13 Parameters

Paper • 2602.04118 • Published 6 days ago • 4

reacted to MikeDoes's post with 🔥🚀 about 16 hours ago

Post

1822

You don't need a massive research lab to build a privacy-preserving AI tool thanks to open datasets. With the right ingredients, anyone can.

A fantastic new guide shows how the democratization of AI is helping to advance safety. It walks through how to use Google's new fine-tuning API to turn Gemini into a powerful tool for PII anonymization.

This project was powered by two key components:

An accessible platform from Google.

High-quality, open-source training data.

We are honored that the author chose the Ai4Privacy pii-masking-200k dataset to provide the crucial data foundation. Our dataset delivered the volume and structure needed to successfully teach a state-of-the-art model how to perform a critical privacy function.

This is the future we're working towards: powerful platforms combined with open, safety-focused data to create tools that benefit everyone. Kudos to the author for showcasing what's possible!

🔗 Read the full step-by-step guide: https://www.analyticsvidhya.com/blog/2024/03/guide-to-fine-tuning-gemini-for-masking-pii-data/

🚀 Stay updated on the latest in privacy-preserving AI—follow us on LinkedIn: https://www.linkedin.com/company/ai4privacy/posts/

#AIforGood #DemocratizeAI #DataPrivacy #Anonymization #OpenSource #LLM #Ai4Privacy