Modprobe: FATAL: nvidia-uvm module not found in / lib / modules /

I had a problem recently after successfully installing and testing Tensorflow compiled with GPU support.

After restarting the computer, I got the following error message when I tried to start the Tensorflow program:

...
('Extracting', 'MNIST_data/t10k-labels-idx1-ubyte.gz')  
modprobe: FATAL: Module nvidia-uvm not found in directory /lib/modules/4.4.0-34-generic
E tensorflow/stream_executor/cuda/cuda_driver.cc:491] failed call to cuInit: CUDA_ERROR_UNKNOWN
I tensorflow/stream_executor/cuda/cuda_diagnostics.cc:140] kernel driver does not appear to be running on this host (caffe-desktop): /proc/driver/nvidia/version does not exist
I tensorflow/core/common_runtime/gpu/gpu_init.cc:92] No GPU devices available on machine.
(0, 114710.45)
(1, 95368.891)
...
(98, 56776.922)
(99, 57289.672)

Screencapture of error

Code: https://github.com/llSourcell/autoencoder_demo

Question: Why does restarting Ubuntu 16.04 interrupt Tensorflow?

+4
source share
2 answers

I really solved my own problem and wanted to share a solution that worked for me.

Google : "modprobe: FATAL: nvidia-uvm /lib/modules/ "

askubuntu: https://askubuntu.com/a/496146

, , , , 404, .

: , Ubuntu, , .

. NVIDIA .

: NVIDIA X

NVIDIA

1) :

askubuntu:

1) (Ctrl + Alt + F1 F6).

2) ( ) sudo ./<DRIVER>.run -K

"": https://askubuntu.com/a/496146

, , "" :

2) SSH:

:

http://ubuntuhandbook.org/index.php/2014/01/boot-into-text-console-ubuntu-linux-14-04/

$ sudo cp -n /etc/default/grub /etc/default/grub.orig
$ sudo nano /etc/default/grub
$ sudo update-grub

grub (3 ):

  • GRUB_CMDLINE_LINUX_DEFAULT = " ", # , Ubuntu.
  • GRUB_CMDLINE_LINUX = " GRUB_CMDLINE_LINUX =" ", Ubuntu .
  • # GRUB_TERMINAL = console, # , Grub Menu - ( )

  • UPDATE: ( Ubuntu 16.04 If $sudo systemctl set-default multi-user.target

$ sudo shutdown -r now
$ sudo service lightdm stop
$ sudo ./<DRIVER>.run

NVIDIA

$ sudo mv /etc/default/grub /etc/default/grub.textonly 
$ sudo mv /etc/default/grub.orig /etc/default/grub 
$ sudo update-grub
$ sudo shutdown -r now

( , GPU )

...
('Extracting', 'MNIST_data/t10k-labels-idx1-ubyte.gz')
I tensorflow/core/common_runtime/gpu/gpu_init.cc:118] Found device 0 with properties: 
name: GeForce GTX 970
major: 5 minor: 2 memoryClockRate (GHz) 1.342
pciBusID 0000:01:00.0
Total memory: 3.94GiB
Free memory: 3.88GiB
I tensorflow/core/common_runtime/gpu/gpu_init.cc:138] DMA: 0 
I tensorflow/core/common_runtime/gpu/gpu_init.cc:148] 0:   Y 
I tensorflow/core/common_runtime/gpu/gpu_device.cc:868] Creating TensorFlow device (/gpu:0) -> (device: 0, name: GeForce GTX 970, pci bus id: 0000:01:00.0)
(0, 113040.92)
(1, 94895.867)
...

Screencapture

+4

: NVIDIA X:

  • ubuntu SSH
  • ( ) Ubuntu
  • sudo,
0

Source: https://habr.com/ru/post/1651703/


All Articles