πππ The largest ever dataset of co-folded 3D protein-ligand structures just dropped on HF!!
Meet SAIR (Structurally Augmented ICβ β Repository): 5M+ AI-generated complexes with experimentally measured drug potency data from SandboxAQ. πππ
Snooping on HF is the best because sometimes you just discover that someone (in this case, Earth Species Project) is about to drop terabytes of sick (high quality animal sounds) data...
Just dropped two bigger physics datasets (both on photonics)!
NUMBA 1: SIB-CL This dataset of Surrogate- and Invariance-Boosted Contrastive Learning (SIB-CL) datasets for two scientific problems: - PhC2D: 2D photonic crystal density-of-states (DOS) and bandstructure data. - TISE: 3D time-independent SchrΓΆdinger equation eigenvalue and eigenvector solutions.
NUMBA2: 2D Photonic Topology Symmetry-driven analysis of 2D photonic crystals: 10k random unit cells across 11 symmetries, 2 polarizations, 5 contrasts. Includes time-reversal breaking cases for 4 symmetries at high contrast.
Always surprised that so few people actually read the FineTasks blog, on β¨how to select training evals with the highest signalβ¨
If you're serious about training models without wasting compute on shitty runs, you absolutely should read it!!
An high signal eval actually tells you precisely, during training, how wel & what your model is learning, allowing you to discard the bad runs/bad samplings/...!
The blog covers in depth prompt choice, metrics, dataset, across languages/capabilities, and my fave section is "which properties should evals have"π (to know on your use case how to select the best evals for you)
If you've followed the progress of robotics in the past 18 months, you've likely noticed how robotics is increasingly becoming the next frontier that AI will unlock.
At Hugging Faceβin robotics and across all AI fieldsβwe believe in a future where AI and robots are open-source, transparent, and affordable; community-built and safe; hackable and fun. We've had so much mutual understanding and passion working with the Pollen Robotics team over the past year that we decided to join forces!
You can already find our open-source humanoid robot platform Reachy 2 on the Pollen website and the Pollen community and people here on the hub at pollen-robotics
We're so excited to build and share more open-source robots with the world in the coming months!