What prerequisites must be skipped when adding GPU nodes to a managed Kubernetes service on an Azure AKS cluster using Azure CLI?
An NAI administrator has successfully imported a model from Hugging Face and created an endpoint for the model. The endpoint is in the Active state. From within the Endpoint section in NAI, the endpoint has been tested with a Sample Request, the response is accurate, and the Status shows Succeeded.
The administrator has provided the endpoint URL and generated and provided API keys to the developers. However, the developers are having issues connecting to the endpoint. They keep getting 400 Bad Request errors when attempting to prompt the model.
What should the administrator do next to ensure the developers are able to successfully prompt the model?
Which widget within the Endpoint Details page can an administrator verify the memory usage associated with the endpoint?
An administrator needs to spot the busiest credentials at a glance.
Which Dashboard widget provides insight into the most frequently used credentials?
An administrator is deploying the required infrastructure on-premise using NKP on Nutanix to install NAI.
Which type of storage should be created during the deployment process to save the models?
Which deployment type of Nutanix Enterprise AI is supported in Amazon EKS?
An AI/ML administrator has received a message from the cloud platform operations team who manage the underlying compute infrastructure that there may be a resource consumption issue impacting the workload.
The AI/ML administrator isn't aware of any problems reported from the consumers of the Nutanix Enterprise AI system but has noted that additional workloads were placed on the platform recently, as well as the introduction of GPUs.
With the cloud platform team reporting a resource consumption issue, and the consumers of the service not reporting any issues, what steps should the AI/ML administrator take?
Which endpoint attribute displays the number and name of the GPUs per instance?
An administrator logs onto the Nutanix Enterprise AI Dashboard and notices that all current GPU and non-GPU Endpoints are marked as Hibernated. After reviewing the Audit Events section, the administrator notes that no anomalous behavior events have occurred since the day before.
What is the most likely cause as to why all of the current Endpoints are hibernated seemingly at random?
An AI/ML admin is testing access to an endpoint using Open AI compatible clients, but is unable to successfully access the endpoint.
What could be the issue?
An AI/ML administrator is monitoring a Nutanix Enterprise AI cluster and receives an alert that the cluster's health status is Critical.
The administrator logs into the NAI Dashboard and gathers the following information:
The Infrastructure Summary component is marked as Critical (red status).
A system message indicates that their newly added Custom Chatbot service is waiting for available resources to start.
A resource usage summary shows that CPU usage is at or near 100%.
Other services are running but are responding more slowly than usual.
The cluster is currently not configured to automatically add more resources when needed.
Based on the information that the administrator gathered, what is the most appropriate action that the administrator should take to remediate the issue?
An Accounting Department is thrilled with the RAG Application that the Application & Data Science Teams recently rolled out. However, they provided some feedback that sometimes (approximately 20% of the time), the documents retrieved are not relevant to their prompts or are too generic.
During development, there was extensive testing between models to make sure the best possible model was selected. The Accounting Department emphasizes that when the responses use the right documents, the results are very good and they are pleased with the completeness, accuracy, and coherence of those responses.
What would be a way to address the irrelevant RAG results without having to rebuild the entire workflow?
How should a non-text-generation LLM endpoint be tested?
An administrator is setting up Nutanix Enterprise AI with a custom domain (ai.company.com) and must comply with security policies requiring a valid TLS certificate from the corporate certificate authority (CA).
Which two steps are necessary to complete this configuration successfully? (Choose two.)
An administrator needs to search for an available NAI helm chart version.
Which command should the administrator use?
An administrator is configuring a new endpoint in Nutanix Enterprise AI (NAI) using an NVIDIA NIM model. During the configuration, the administrator notices that the Use GPU checkbox is selected and cannot be modified.
What is the reason the checkbox is locked in this state?
When preparing GPU nodes for a Nutanix Enterprise AI deployment within a GKE environment, how should an administrator treat the installation of the NVIDIA drivers?
An administrator is integrating an application with an endpoint and finds that the application is experiencing high latency.
What action should the administrator take to ensure the lowest latency when creating endpoints?
An administrator attempts to import a pre-trained model into Nutanix Enterprise AI (NAI), but the model fails to appear in the deployment list, and endpoint creation is unavailable.
Which action should the administrator take to troubleshoot the issue?
Nutanix AI officially supports which two LLM packaging formats? (Choose two.)
What is the correct endpoint PATH that is displayed in the NAI Dashboard?
In a new instance of Nutanix Enterprise AI, what task must be completed to create an API key?