Questions tagged [gpu]

64 questions
35
votes
7 answers

Does a server need a GPU?

Do I need a GPU on a text and console only server? No GPU as in no iGPU and dGPU. Im going to be using SSH, so I dont need a display out. Im using Linux, but the OS shouldn't affect the results
tymur999
  • 537
5
votes
1 answer

The GPU usage provided by nvidia-smi command is very different from GPU metrics from guest OS

I'm working on a project that can monitor virtual machines' vgpu usage. The hypervisor is vCenter, we have nvidia A16 cards installed on vCenter hosts, and assigned a16 vGPU to a couple of windows VMs on this host, theses vGPUs are allocated to the…
5
votes
2 answers

GCP does not have enough resources available to fulfill the request for about a month

I've been trying to start my existing GCP VM that has an NVIDIA T4 GPU attached to it, for almost a month at this time. It has been working fine before but now I am constantly getting the error message: The zone '***' does not have enough resources…
masus04
  • 181
  • 1
  • 2
  • 6
5
votes
1 answer

Kubernetes: How can I get which pod schedule GPU?

I have three Nvidia GPUs in my cluster, and so many pods are running in my cluster. How can I find which one of these pods schedule GPU and how many GPUs they schedule? I use this link to enable the GPU schedule in my cluster for Nvidia…
Nader
  • 163
  • 1
  • 9
4
votes
1 answer

GPU Acceleration on a Windows Server without virtualization over RDP

I'm trying to find out if it's possible to run a Windows Server with one GPU which is shared between all RDP clients so that people could create a session on the server start some program with a UI which needs GPU acceleration disconnect afterwards…
4
votes
1 answer

What is the best metric for auto-scaling GPU instances for machine learning inference in the cloud?

We have an API in AWS with a GPU instance that does inference. We have an auto-scaler setup with the minimum and maximum number of instances, but aren’t sure which metric (GPU/CPU usage, RAM usage, average latency, etc) or combination of metrics…
4
votes
0 answers

Make Headless Server Use Hardware Accelaration

I have a headless CentOS 8 Server with an AMD GPU. I want to use hardware acceleration but when I run OpenGL programs with xvfb-run (e.g glxinfo), the system reports I am using software rendering. How can I use hardware acceleration on a headless…
user548851
4
votes
1 answer

Use passthrough GPU in KVM/QEMMU and display in host OS in a window

I'm new to KVM/QEMMU. I have used virtual box to run Windows 10 in a virtual machine on my Arch host system (a laptop with both integrated and discrete GPUs). Being dissatisfied with the video responsiveness of the guest machine I decided to assign…
brett
  • 141
2
votes
0 answers

Quota 'GPUS_ALL_REGIONS' exceeded. Limit: 0.0 globally

I am trying to create the VM instance with NVIDIA K80 GPUs in Asia-East1 so, I requested to increase the quota and team have adjusted the quota. However, when I am trying to create the VM instance by selecting the NVIDIA K80 GPUs with 4 CPUs and…
2
votes
1 answer

How do you disable hardware-accelerated GPU scheduling via the command line in Windows 10/11?

I need to disable, programmatically, hardware-accelerated GPU scheduling in Windows if it's enabled. Searching, I was pointed at the HKLM\SYSTEM\CurrentControlSet\Control\GraphicsDrivers - HwSchMode regkey. This is 2 when HW GPU scheduling is…
Jason Floyd
  • 1,912
2
votes
1 answer

What socket(s) does the aux power for a GPU come from in a PowerEdge T550?

Server: Dell PowerEdge T550 Tower Server PSU: Single, Hot Plug, Non-Redundant Power Supply (1+0), 1100W, Mixed Mode Titanium GPU: NVIDIA A40 Photos: https://www.reddit.com/user/bigboyserver/comments/10jj1fj/dell_poweredge_t550_nvidia_a40/ Problem…
2
votes
1 answer

Pod is stuck in PodInitializing status when an initContainer is OOMKilled

I have the following on-prem Kubernetes environment: OS: Red Hat Enterprise Linux release 8.6 (Ootpa) Kubernetes: 1.23.7 (single-node, build with kubeadm) NVIDIA driver: 515.65.01 nvidia-container-toolkit: 1.10.0-1.x86_64 (rpm) containerd:…
Daigo
  • 393
2
votes
1 answer

GCP: Cannot create any VM with GPU -> No capacity

I subscribed to GCP and received the $300 credits. Then I upgraded my account to "paid account". Next, I increased the limit for multiple VM types with GPU, in multiple regions, and received approval emails. I tried creating a VM with GPU in…
cloud_IaaS
  • 21
  • 2
2
votes
0 answers

Slurm srun cannot allocate ressources for GPUs - Invalid generic resource specification

I am able to launch a job on a GPU server the traditional way (using CPU and MEM as consumables): ~ srun -c 1 --mem 1M -w serverGpu1 hostname serverGpu1 but trying to use the GPUs will give an error: ~ srun -c 1 --mem 1M --gres=gpu:1 hostname srun:…
user324810
  • 121
  • 1
  • 3
2
votes
1 answer

Considerations using consumer class (high-end) GPU in server?

Motivation: First of all, even if I have some knowledge of computer science, software development and server Linux administration, I never looked into a server hardware and I am a total "newbie" to it. Sorry if this question is trivial to most of…
1
2 3 4 5