Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
zhiwei's picture

zhiwei

dtxw

AI & ML interests

None yet

Organizations

None yet

Collections 2

LLM
  • Large Language Models as Optimizers

    Paper • 2309.03409 • Published Sep 7, 2023 • 78
  • DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models

    Paper • 2309.03883 • Published Sep 7, 2023 • 35
  • Fine-Tuning Language Models with Just Forward Passes

    Paper • 2305.17333 • Published May 27, 2023 • 3
  • E^2-LLM: Efficient and Extreme Length Extension of Large Language Models

    Paper • 2401.06951 • Published Jan 13, 2024 • 26
RLHF
  • Efficient RLHF: Reducing the Memory Usage of PPO

    Paper • 2309.00754 • Published Sep 1, 2023 • 15
LLM
  • Large Language Models as Optimizers

    Paper • 2309.03409 • Published Sep 7, 2023 • 78
  • DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models

    Paper • 2309.03883 • Published Sep 7, 2023 • 35
  • Fine-Tuning Language Models with Just Forward Passes

    Paper • 2305.17333 • Published May 27, 2023 • 3
  • E^2-LLM: Efficient and Extreme Length Extension of Large Language Models

    Paper • 2401.06951 • Published Jan 13, 2024 • 26
RLHF
  • Efficient RLHF: Reducing the Memory Usage of PPO

    Paper • 2309.00754 • Published Sep 1, 2023 • 15

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs