models mistralai Mixtral 8x7B Instruct v01 - Azure/azureml-assets GitHub Wiki

mistralai-Mixtral-8x7B-Instruct-v01

Overview

The Mixtral-8x7B Large Language Model (LLM) is a pretrained generative Sparse Mixture of Experts. The Mixtral-8x7B outperforms Llama 2 70B on most benchmarks with 6x faster inference.

Mixtral-8x7B-v0.1 is a decoder-only model with 8 distinct groups or the "experts". At every layer, for every token, a router network chooses two of these experts to process the token and combine their output additively. Mixtral has 46.7B total parameters but only uses 12.9B parameters per token using this technique. This enables the model to perform with same speed and cost as 12.9B model.

For full details of this model please read release blog post.

Limitations and Biases

The Mixtral-8x7B Instruct model is a quick demonstration that the base model can be easily fine-tuned to achieve compelling performance.

It does not have any moderation mechanisms. We're looking forward to engaging with the community on ways to make the model finely respect guardrails, allowing for deployment in environments requiring moderated outputs.

Sample inputs and outputs

Sample input

{
    "input_data": {
        "input_string": [
            {
                "role": "user",
                "content": "I am going to Paris, what should I see?"
            },
            {
                "role": "assistant",
                "content": "Paris, the capital of France, is known for its stunning architecture, art museums, historical landmarks, and romantic atmosphere. Here are some of the top attractions to see in Paris:\n\n1. The Eiffel Tower: The iconic Eiffel Tower is one of the most recognizable landmarks in the world and offers breathtaking views of the city.\n2. The Louvre Museum: The Louvre is one of the world's largest and most famous museums, housing an impressive collection of art and artifacts, including the Mona Lisa.\n3. Notre-Dame Cathedral: This beautiful cathedral is one of the most famous landmarks in Paris and is known for its Gothic architecture and stunning stained glass windows.\n\nThese are just a few of the many attractions that Paris has to offer. With so much to see and do, it's no wonder that Paris is one of the most popular tourist destinations in the world."
            },
            {
                "role": "user",
                "content": "What is so great about #1?"
            }
        ],
        "parameters": {
            "temperature": 0.6,
            "top_p": 0.9,
            "do_sample": true,
            "max_new_tokens": 200,
            "return_full_text": true
        }
    }
}

Sample output

{
  "output": "The Eiffel Tower is one of the most iconic landmarks in the world and is a must-see attraction in Paris. Here are a few reasons why the Eiffel Tower is so great:\n\n1. Iconic symbol: The Eiffel Tower is instantly recognizable and is often used as a symbol of Paris and France.\n2. Breathtaking views: The Eiffel Tower offers stunning views of the city, and there are several different viewing platforms to choose from, including the top floor, which is located 906 feet (276 meters) above the ground.\n3. Romantic atmosphere: The Eiffel Tower is often associated with romance and is a popular spot for proposals and weddings.\n4. Historical significance: The Eiffel Tower was built for the 1889 World's Fair and was originally intended to be a temporary structure. However, it has since become a permanent fixture in the Paris"
}

Version: 10

Tags

Featured SharedComputeCapacityEnabled hiddenlayerscanned disable-batch : true huggingface_model_id : mistralai/Mixtral-8x7B-Instruct-v0.1 inference_compute_allow_list : ['Standard_ND40rs_v2', 'Standard_NC96ads_A100_v4', 'Standard_ND96amsr_A100_v4', 'Standard_ND96asr_v4'] inference_supported_envs : ['vllm'] license : apache-2.0 task : chat-completion author : Mistral AI benchmark : quality

View in Studio: https://ml.azure.com/registries/azureml/models/mistralai-Mixtral-8x7B-Instruct-v01/version/10

License: apache-2.0

Properties

SharedComputeCapacityEnabled: True

SHA: 5c79a376139be989ef1838f360bf4f1f256d7aec

inference-min-sku-spec: 40|4|672|256

inference-recommended-sku: Standard_ND40rs_v2, Standard_NC96ads_A100_v4, Standard_ND96amsr_A100_v4, Standard_ND96asr_v4

languages: EN

⚠️ **GitHub.com Fallback** ⚠️