Unified Model Records

Vicuna

Type: family

Tags: opensource Publisher: LMSYS ORG Released: 2023-03-30 v.1.0.0

Metadata

General information.

name: Vicuna
version: 1.0.0
publisher: LMSYS ORG
model type: Language Model
release date: 2023-03-30

Relations

No relations specified.

Relationship Graph

Relationship Graph for vicuna

Intended Use

Research and academic purposes
Open-source innovation
Enhancement of chatbot technology
Public demonstration and education
Non-commercial use

Factors

Model's ability to understand and generate human-like responses
Performance comparison with other LLM-based models like ChatGPT and Alpaca
The model's handling of multi-turn conversations
Capability to be fine-tuned on specific use-cases or datasets
Cost-effectiveness of training and deploying the model

Evaluation Data

Use of GPT-4 for preliminary evaluation and chatbot performance assessment
Collection of 70K user-shared ChatGPT conversations from ShareGPT.com
Creation of 80 diverse questions for model quality evaluation
Comparison between Vicuna, LLaMA, Alpaca, ChatGPT, and Bard
Assessment based on categories like Fermi problems, roleplay scenarios, and coding/math tasks

Training Data

Approximately 70K user-shared conversations from ShareGPT.com
Data converted from HTML to markdown and filtered for quality
Long conversations segmented into smaller parts to fit model's context length
Enhanced training scripts from Stanford Alpaca project
Publicly available datasets totaling 1T tokens for initial LLaMA model

Additional Information

The model is built by fine-tuning LLaMA with conversational data
Designed to generate more detailed and structured responses
Training cost was significantly reduced by using managed spot instances
A lightweight distributed serving system implemented for model deployment
Not all chatbots' limitations, such as reasoning or mathematics, can be addressed

Recommendations

Best suited for research, academic, and open-source development purposes
Use in conjunction with moderation APIs to address potential safety concerns
Further fine-tuning with specific datasets for targeted applications encouraged
Continuous evaluation and comparison with emerging models for improvements
Not intended for direct commercial use without proper evaluation