Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
guo1006 's Collections
audio
computer vision
multimodel
nlp

multimodel

updated Sep 17, 2025
Upvote
-

  • guo1006/layoutlmv2-base-uncased-finetuned-docvqa_1200_examples

    Document Question Answering • 0.2B • Updated Sep 16, 2025 • 7

  • guo1006/git-base-pokemon-captioning-generate

    Image-to-Text • 0.2B • Updated Sep 16, 2025 • 11

  • guo1006/vilt-b32-mlm-finetuned-vqa-800

    Visual Question Answering • 0.1B • Updated Sep 17, 2025 • 3

  • guo1006/speecht5-finetuned-voxpopuli_nl

    Text-to-Audio • 0.1B • Updated Sep 17, 2025 • 4
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs