Unified Model Records

Vicuna

Type: family

Tags: opensource Publisher: LMSYS ORG Released: 2023-03-30 v.1.0.0

Metadata

General information.

name
Vicuna
version
1.0.0
publisher
LMSYS ORG
model type
Language Model
release date
2023-03-30

Relations

No relations specified.

Relationship Graph

Relationship Graph for vicuna

Intended Use

  • Research and academic purposes
  • Open-source innovation
  • Enhancement of chatbot technology
  • Public demonstration and education
  • Non-commercial use

Factors

  • Model's ability to understand and generate human-like responses
  • Performance comparison with other LLM-based models like ChatGPT and Alpaca
  • The model's handling of multi-turn conversations
  • Capability to be fine-tuned on specific use-cases or datasets
  • Cost-effectiveness of training and deploying the model

Evaluation Data

  • Use of GPT-4 for preliminary evaluation and chatbot performance assessment
  • Collection of 70K user-shared ChatGPT conversations from ShareGPT.com
  • Creation of 80 diverse questions for model quality evaluation
  • Comparison between Vicuna, LLaMA, Alpaca, ChatGPT, and Bard
  • Assessment based on categories like Fermi problems, roleplay scenarios, and coding/math tasks

Training Data

  • Approximately 70K user-shared conversations from ShareGPT.com
  • Data converted from HTML to markdown and filtered for quality
  • Long conversations segmented into smaller parts to fit model's context length
  • Enhanced training scripts from Stanford Alpaca project
  • Publicly available datasets totaling 1T tokens for initial LLaMA model

Additional Information

  • The model is built by fine-tuning LLaMA with conversational data
  • Designed to generate more detailed and structured responses
  • Training cost was significantly reduced by using managed spot instances
  • A lightweight distributed serving system implemented for model deployment
  • Not all chatbots' limitations, such as reasoning or mathematics, can be addressed

Recommendations

  • Best suited for research, academic, and open-source development purposes
  • Use in conjunction with moderation APIs to address potential safety concerns
  • Further fine-tuning with specific datasets for targeted applications encouraged
  • Continuous evaluation and comparison with emerging models for improvements
  • Not intended for direct commercial use without proper evaluation