Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
538.8
TFLOPS
4
61
52
Jaward Sesay
Jaward
Follow
KyriakiTych's profile picture
oumafr's profile picture
NSOUP's profile picture
357 followers
·
24 following
https://github.com/Jaykef
JawardSesay_
Jaykef
AI & ML interests
Building Lectūra Labs | CS Grad Student @BIT | AI/ML Research: Autonomous Agents, LLMs | Building The Cursor for Learning | Role Model Karpathy
Recent Activity
posted
an
update
about 3 hours ago
Kimi team dropped a major improvement to the transformer architecture and it quietly targets one of the most taken-for-granted components: residual connections. For nearly a decade, transformers (since introduction) have relied on residuals that simply add all previous layer outputs equally. It works but it’s also kind of… dumb. Kimi’s new paper, “Attention Residuals (AttnRes)”, replaces that with something much more intelligent: → instead of blindly summing past layers, → it learns which layers matter, → and dynamically weight contributions across depth. So attention is no longer just over tokens…it’s now also over layers (depth). This means effectively turning depth into a dynamic memory system, phenomenal!
upvoted
a
paper
about 1 month ago
EgoActor: Grounding Task Planning into Spatial-aware Egocentric Actions for Humanoid Robots via Visual-Language Models
posted
an
update
about 2 months ago
data in support of findings in our new work on personalized embodied teaching/learning is out, paper coming soon. https://huggingface.co/datasets/Jaward/lectura-agents-data
View all activity
Organizations
Jaward
's models
5
Sort: Recently updated
Jaward/afri-aya-vision-krio-8b
Text Generation
•
9B
•
Updated
Oct 6, 2025
•
2
•
1
Jaward/CodeOptimus-Instruct-Mistral-7B-v0.1.gguf
7B
•
Updated
Mar 13, 2025
•
13
•
1
Jaward/smollm2_360m_grpo_gsm8k_reasoner
Text Generation
•
0.4B
•
Updated
Mar 4, 2025
•
2
•
1
Jaward/phi-3-mini-4k-instruct.Q4_0.gguf
Text Generation
•
4B
•
Updated
Apr 27, 2024
•
420
•
3
Jaward/mlx-bge-small-en
Updated
Apr 17, 2024
•
3