Shin Tech Notes: June 2019

Saturday, June 22, 2019

Installing TensorFlow GPU with Anaconda

Now I got Ubuntu 19 installed on Ryzen 2700 + RTX 2070 (27 Combo I call). Next step was to install TensorFlow GPU. Ndvia driver should be installed first though.

Here were general steps I did,
1. Install Anaconda. Python 3.7 version I used.
2. Create an environment, $ conda create --name tf-gpu
3. $ source activate tf-gpu
4. $ conda install tensorflow-gpu

This link helped me a lot to install TensorFlow GPU,
https://www.pugetsystems.com/labs/hpc/Install-TensorFlow-with-GPU-Support-the-Easy-Way-on-Ubuntu-18-04-without-installing-CUDA-1170/

Installing Windows 10 and Ubuntu 19 on ASRock X470 Taichi

Motherboard: ASRock X470 Taichi

Drives:
1. Ultra M.2 Socket/PCIe Gen3x4 (Up to 32Gb/s): ADATA XPG SX8200 Pro 1TB, Sequential read/write speed up to 3500/3000 MB/s

2. M.2 Socket/PCIe Gen2x4 (Up to 20Gb/s): Intel SSD 660p Series 1TB, Sequential read/write speed up to 1800/1800 MB/s

OS Installation:
1. Install Windows 10 on ADATA XPG, full format
2. In BIOS, set Intel SSD 1st boot. You must select it in BBS Prioties first.
3. Install Ubuntu 19 on the Intel SSD drive.
4. In BIOS, set ADATA XPG as the 1st boot drive again.
5. Reboot the machine and you get the GRUB OS loader to select Unbuntu or Windows 10.

Comment:
I am not sure why this worked. When I had set ADATA as the 1st boot drive then installed Ubuntu on Intel SSD, then it didn't work.

TensorFlow CUDNN_STATUS_INTERNAL_ERROR

When I was executing code with TensorFlow's nn.conv2d() function. I got a problem and two errors,

E tensorflow/stream_executor/cuda/cuda_dnn.cc:334] Could not create cudnn handle: CUDNN_STATUS_INTERNAL_ERROR

tensorflow.python.framework.errors_impl.UnknownError: Failed to get convolution algorithm. This is probably because cuDNN failed to initialize, so try looking to see if a warning log message was printed above. [[{{node Conv2D}}]]

I searched for the errors and found some other people had the same problem.
Basically, when you run TensorFlow session, you have to set "config.gpu_options.allow_growth = True" for GPU memory management.

https://www.tensorflow.org/guide/using_gpu#allowing_gpu_memory_growth

You have to put these lines in your code,
#
from tensorflow.compat.v1 import ConfigProto
from tensorflow.compat.v1 import InteractiveSession

config = ConfigProto()
config.gpu_options.allow_growth = True
session = InteractiveSession(config=config)
#

By the way, my graphic card is EVGA GeForce RTX 2070 Black.

Saturday, June 1, 2019

Math for Deep Learning

TensorFlow and Keras make a deep learning (DL) programming easier. The supervised DL program’s

basic steps are in general; load data, define neural network model, compile the model, fit or train for loss

function parameters. It’s done in that order and pretty simple with those libraries.

However, logic behind those libraries is real math; Geometry for vector operations, Derivative and

differentiation for loss functions, more specifically, stochastic gradient descent, chaining derivatives,

reverse-mode differentiation, and symbolic differentiation. And statistics of course.

More you understand those math, more deeply you can understand DL and use those libraries more

effectively.

Reserved Indices in Keras Reuters Data Set

When I was checking "Classifying newswires: a multi-class classification example", I came across one question about the offset value of the word index of the Reuter newswires data set when decoding newswire text.
According to the author of "Deep Learning with Python", index 0, 1 and 2 are reserved indices for "padding", "start of sequence", and "unknown".

If you load the Reuter data set,

reuters = keras.datasets.reuters

(train_data, train_labels), (test_data, test_labels) = reuters.load_data()

And get, index:word,

reverse_word_index = dict([(value, key) for (key, value) in word_index.items()])

with open('reserve_word_index.txt', 'w') as re_word_index_file:

for key, val in sorted(reverse_word_index.items()):

re_word_index_file.write(str(key) + ':' + str(val) + '\n')

Open the output file of the run result of above code, I get something like this,

1:the

2:of

3:to

4:in

5:said

...

First thing I noticed, as you see the result, there is no index 0, then index 1 is "the" not "start of sequence", and index 2 is "of" not "unknown".

I printed out the first newswire in the train data to see what's inside,

print(train_data[0])

[1, 27595, 28842, 8, 43, 10, 447, 5, 25, 207, 270, 5, 3095, 111, 16, 369, 186, 90, 67, 7, 89, 5, 19, 102, 6, 19, 124, 15, 90, 67, 84, 22, 482, 26, 7, 48, 4, 49, 8, 864, 39, 209, 154, 6, 151, 6, 83, 11, 15, 22, 155, 11, 15, 7, 48, 9, 4579, 1005, 504, 6, 258, 6, 272, 11, 15, 22, 134, 44, 11, 15, 16, 8, 197, 1245, 90, 67, 52, 29, 209, 30, 32, 132, 6, 109, 15, 17, 12]

Then decoded train_data[0], without -3 off set,

decoded_newswire = ' '.join([reverse_word_index.get(i, '?') for i in train_data[0]])

print(decoded_newswire) 

"the wattie nondiscriminatory mln loss for plc said at only ended said commonwealth could 1 traders now april 0 a after said from 1985 and from foreign 000 april 0 prices its account year a but in this mln home an states earlier and rise and revs vs 000 its 16 vs 000 a but 3 psbr oils several and shareholders and dividend vs 000 its all 4 vs 000 1 mln agreed largely april 0 are 2 states will billion total and against 000 pct dlrs"

Now, get the decoded newswire with offset -3 as the author of Deep Learning with Python said,

decoded_newswire = ' '.join([reverse_word_index.get(i - 3, '?') for i in train_data[0]])

print(decoded_newswire)

"? mcgrath rentcorp said as a result of its december acquisition of space co it expects earnings per share in 1987 of 1 15 to 1 30 dlrs per share up from 70 cts in 1986 the company said pretax net should rise to nine to 10 mln dlrs from six mln dlrs in 1986 and rental operation revenues to 19 to 22 mln dlrs from 12 5 mln dlrs it said cash flow per share this year should be 2 50 to three dlrs reuter 3"

So my conclusion may be that every word in the data set has "padding", "start of sequence", and "unknown", in other word, 3 for those are added in the word index number. For example, 6893 which is "dramatically" in the index_words items are actually stored (encoded?) as 6896 in the data set. When you decode the word from the data set and map with the word index dictionary, you need to decode by offsetting by 3 to get the actual word in a newswire.