assets/models/system/Mistral-7B-Instruct-v0-2-cuda-gpu/spec.yaml

$schema: https://azuremlschemas.azureedge.net/latest/model.schema.json name: mistralai-Mistral-7B-Instruct-v0-2-cuda-gpu version: 1 path: ./ tags: foundryLocal: "" license: "apache-2.0" licenseDescription: "This model is provided under the License Terms available at <https://www.apache.org/licenses/LICENSE-2.0.html>." author: Microsoft inputModalities: "text" outputModalities: "text" task: chat-completion maxOutputTokens: 2048 alias: mistral-7b-v0.2 type: custom_model variantInfo: parents: - assetId: azureml://registries/azureml/models/mistralai-Mistral-7B-Instruct-v0-2/versions/6 variantMetadata: modelType: 'ONNX' quantization: ['RTN'] device: 'gpu' executionProvider: 'CUDAExecutionProvider' fileSizeBytes: 4273492459 vRamFootprintBytes: 4273717196

assets/models/system/Mistral-7B-Instruct-v0-2-cuda-gpu/spec.yaml (25 lines of code) (raw):