- name
- Vicuna
- version
- 1.0.0
- publisher
- LMSYS ORG
- model type
- Language Model
- release date
- 2023-03-30
Vicuna
Type: family
Tags: opensource
Publisher: LMSYS ORG
Released: 2023-03-30
v.1.0.0
Metadata
General information.
Relations
No relations specified.
Relationship Graph
Intended Use
- Research and academic purposes
- Open-source innovation
- Enhancement of chatbot technology
- Public demonstration and education
- Non-commercial use
Factors
- Model's ability to understand and generate human-like responses
- Performance comparison with other LLM-based models like ChatGPT and Alpaca
- The model's handling of multi-turn conversations
- Capability to be fine-tuned on specific use-cases or datasets
- Cost-effectiveness of training and deploying the model
Evaluation Data
- Use of GPT-4 for preliminary evaluation and chatbot performance assessment
- Collection of 70K user-shared ChatGPT conversations from ShareGPT.com
- Creation of 80 diverse questions for model quality evaluation
- Comparison between Vicuna, LLaMA, Alpaca, ChatGPT, and Bard
- Assessment based on categories like Fermi problems, roleplay scenarios, and coding/math tasks
Training Data
- Approximately 70K user-shared conversations from ShareGPT.com
- Data converted from HTML to markdown and filtered for quality
- Long conversations segmented into smaller parts to fit model's context length
- Enhanced training scripts from Stanford Alpaca project
- Publicly available datasets totaling 1T tokens for initial LLaMA model
Additional Information
- The model is built by fine-tuning LLaMA with conversational data
- Designed to generate more detailed and structured responses
- Training cost was significantly reduced by using managed spot instances
- A lightweight distributed serving system implemented for model deployment
- Not all chatbots' limitations, such as reasoning or mathematics, can be addressed
Recommendations
- Best suited for research, academic, and open-source development purposes
- Use in conjunction with moderation APIs to address potential safety concerns
- Further fine-tuning with specific datasets for targeted applications encouraged
- Continuous evaluation and comparison with emerging models for improvements
- Not intended for direct commercial use without proper evaluation