Databricks

company

Verified

https://www.databricks.com

databricks

databricks

AI & ML interests

None defined yet.

mylesbaker

in databricks/databricks-dolly-15k 11 months ago

Your employees were clearly bored

#18 opened 11 months ago by

whzhan

authored a paper over 1 year ago

Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF

Paper • 2410.04612 • Published Oct 6, 2024

kartikmosaicml

authored a paper over 1 year ago

Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models

Paper • 2405.20541 • Published May 30, 2024 • 24

whzhan

authored 5 papers over 1 year ago

Provably Efficient CVaR RL in Low-rank MDPs

Paper • 2311.11965 • Published Nov 20, 2023

REBEL: Reinforcement Learning via Regressing Relative Rewards

Paper • 2404.16767 • Published Apr 25, 2024 • 2

Provable Offline Preference-Based Reinforcement Learning

Paper • 2305.14816 • Published May 24, 2023

Provable Reward-Agnostic Preference-Based Reinforcement Learning

Paper • 2305.18505 • Published May 29, 2023

Dataset Reset Policy Optimization for RLHF

Paper • 2404.08495 • Published Apr 12, 2024 • 9

abhi-db

updated a Space almost 2 years ago

README

seankski

authored a paper almost 2 years ago

Towards Characterizing Domain Counterfactuals For Invertible Latent Causal Models

Paper • 2306.11281 • Published Jun 20, 2023

seankski

authored a paper about 2 years ago

StarCraftImage: A Dataset For Prototyping Spatial Reasoning Methods For Multi-Agent Environments

Paper • 2401.04290 • Published Jan 9, 2024 • 3

seankski

authored 2 papers over 2 years ago

Feature Shift Detection: Localizing Which Features Have Shifted via Conditional Distribution Tests

Paper • 2107.06929 • Published Jul 14, 2021

Towards Explaining Distribution Shifts

Paper • 2210.10275 • Published Oct 19, 2022

matthayes

updated a dataset over 2 years ago

databricks/databricks-dolly-15k

Viewer • Updated Jun 30, 2023 • 15k • 20.9k • 903

Skylion007

authored a paper over 2 years ago

Galactic: Scaling End-to-End Reinforcement Learning for Rearrangement at 100k Steps-Per-Second

Paper • 2306.07552 • Published Jun 13, 2023 • 3