TensorFlow basics¶

TensorFlow is a Google library made specifically for neural networks. In this note, we will be introduced to some basics of this main library, as well as the probability module. The topics discussed are:

TensorFlow basics
- Optimization with standard gradient descent
- Slicing and indexing data set
TensorFlow Probability

# pip install -U TensorFlow

import tensorflow as tf

tf.add(1,2)

<tf.Tensor: shape=(), dtype=int32, numpy=3>

tf.add([1.,2.], [3, 4])

<tf.Tensor: shape=(2,), dtype=float32, numpy=array([4., 6.], dtype=float32)>

One thing to note with tensorflow is the types need to be consistent within arrays. Cannot have 1and 1. in same array, vector, or matrix. (Well, this worked for me but not him..)

tf.add([1,2], [3,4])

<tf.Tensor: shape=(2,), dtype=int32, numpy=array([4, 6])>

tf.square(123)

<tf.Tensor: shape=(), dtype=int32, numpy=15129>

tf.constant([[1,3,4],[5,6,7]])

<tf.Tensor: shape=(2, 3), dtype=int32, numpy=
array([[1, 3, 4],
       [5, 6, 7]])>

tf.linalg.inv([[1.,2.], [3.,4.]])

<tf.Tensor: shape=(2, 2), dtype=float32, numpy=
array([[-2.0000002 ,  1.0000001 ],
       [ 1.5000001 , -0.50000006]], dtype=float32)>

TensorFlow automatically uses float32 types, but can be changed. 16 is the fastest. Just keep in mind when error pop up.

Optimization¶

opt = tf.optimizers.SGD(learning_rate=0.3)

Standard gradient deriviation will get an objective function, find a starting point, and follow the steepest decent down to the global minimum.

var = tf.Variable(2.0)

Variables are the counter-part to constants. Can also be scalars, vectors or matrices. Only difference is they can change, while constants can't.

with tf.GradientTape() as tape:
    y = var**2 +1

tape.gradient(y, var)

<tf.Tensor: shape=(), dtype=float32, numpy=4.0>

loss = lambda : var**2 /1.5

loss()

<tf.Tensor: shape=(), dtype=float32, numpy=2.6666667>

opt.minimize(loss, [var])

<tf.Variable 'UnreadVariable' shape=() dtype=int64, numpy=1>

var

<tf.Variable 'Variable:0' shape=() dtype=float32, numpy=0.6666666>

var_vals, loss_vals = [], []
for _ in range(30):       # underscore is a valiad variable name, but means idc,dont worry about it
    opt.minimize(loss, [var])
    var_vals.append(var.numpy())
    loss_vals.append(loss())

import matplotlib.pyplot as plt
%matplotlib inline

plt.plot(var_vals)
plt.plot(loss_vals)
plt.legend(['var', 'loss'])

<matplotlib.legend.Legend at 0x185948d33c8>

Data sets¶

Can be nice to use pandas for preprocessing. But in industry, its a good idea to stick to as few libraries as possible. Tensorflow data sets are made for speed, whereas pandas is more for visualizing.

ds = tf.data.Dataset.from_tensor_slices(list(range(12)))

for i in ds.map(tf.square).shuffle(2).batch(3):
    print(i)

tf.Tensor([1 0 9], shape=(3,), dtype=int32)
tf.Tensor([ 4 25 16], shape=(3,), dtype=int32)
tf.Tensor([36 49 81], shape=(3,), dtype=int32)
tf.Tensor([100 121  64], shape=(3,), dtype=int32)

Want to try and do the data manipulations on the graphics card. No need to manipulate on CPU, then transfer over to GPU.

ds.reduce(0, lambda state, value: state+value)

<tf.Tensor: shape=(), dtype=int32, numpy=66>

state = 0
op = lambda state, value: state+value
for value in ds:
    state = op(state, value)
state

<tf.Tensor: shape=(), dtype=int32, numpy=66>

ds.map(tf.square).reduce(0, tf.add)

<tf.Tensor: shape=(), dtype=int32, numpy=506>

TensorFlow Probability¶

We will use this for Bayesian analysis using tensorflow.

# pip install tensorflow_probability

^C
Note: you may need to restart the kernel to use updated packages.

import tensorflow as tf
import tensorflow_probability as tfp

Generate data¶

Random sample from Binomial with N=60, and $\theta$=0.6

tfd = tfp.distributions

dist = tfd.Binomial(total_count=60, probs=0.6)  # to pull a sample from some random distribution

sample = dist.sample(1)
sample  # happened to match his perfectly, but didnt have to!

<tf.Tensor: shape=(1,), dtype=float32, numpy=array([35.], dtype=float32)>

Posterior Probability - Uniform prior¶

thetas = tf.linspace(start=0., stop=1., num=500)  # have no idea really how these thetas look

# P(theta) -> 
dists = tfd.Binomial(total_count=60, probs=thetas) # my hypothesis is this data follows a Binomial dist.

probs = dists.prob(sample)   # P( datat | hypothesis)

sample /60   # sample=38, 60 observations

<tf.Tensor: shape=(1,), dtype=float32, numpy=array([0.5833333], dtype=float32)>

idx = probs > 0.01
plt.plot(thetas[idx], probs[idx])
plt.xlabel('$\\theta$')
plt.ylabel('likelihood')
None

Posterior sampling¶

Start by generating categorical given my probabilities.

?tf.random.categorical

log_prob = dists.log_prob(sample)
N = 10000    # want 10000 samples from our posterior, sample according to curve to hypothesis

tf.random.categorical(log_prob, N)

---------------------------------------------------------------------------
InvalidArgumentError                      Traceback (most recent call last)
<ipython-input-25-13a336532de8> in <module>
----> 1 tf.random.categorical(log_prob, N)

~\Anaconda3\lib\site-packages\tensorflow\python\ops\random_ops.py in categorical(logits, num_samples, dtype, seed, name)
    451   """
    452   with ops.name_scope(name, "categorical", [logits]):
--> 453     return multinomial_categorical_impl(logits, num_samples, dtype, seed)
    454 
    455 

~\Anaconda3\lib\site-packages\tensorflow\python\ops\random_ops.py in multinomial_categorical_impl(logits, num_samples, dtype, seed)
    459   seed1, seed2 = random_seed.get_seed(seed)
    460   return gen_random_ops.multinomial(
--> 461       logits, num_samples, seed=seed1, seed2=seed2, output_dtype=dtype)
    462 
    463 

~\Anaconda3\lib\site-packages\tensorflow\python\ops\gen_random_ops.py in multinomial(logits, num_samples, seed, seed2, output_dtype, name)
     51       return _result
     52     except _core._NotOkStatusException as e:
---> 53       _ops.raise_from_not_ok_status(e, name)
     54     except _core._FallbackException:
     55       pass

~\Anaconda3\lib\site-packages\tensorflow\python\framework\ops.py in raise_from_not_ok_status(e, name)
   6841   message = e.message + (" name: " + name if name is not None else "")
   6842   # pylint: disable=protected-access
-> 6843   six.raise_from(core._status_to_exception(e.code, message), None)
   6844   # pylint: enable=protected-access
   6845 

~\Anaconda3\lib\site-packages\six.py in raise_from(value, from_value)

InvalidArgumentError: logits should be a matrix, got shape [500] [Op:Multinomial]

idx = tf.random.categorical([log_prob], N)

Will return an index (or indexes, depending on N) responding to the categories

tf.random.categorical(tf.math.log([[0.3, 0.7]]), 10)

<tf.Tensor: shape=(1, 10), dtype=int64, numpy=array([[1, 1, 1, 1, 1, 1, 1, 1, 1, 1]], dtype=int64)>

From these indexes, i need to get back the hypothesis.

theta_sample = tf.gather(thetas, idx)

plt.hist(theta_sample)

(array([  43.,  216.,  776., 1712., 2679., 2489., 1516.,  478.,   83.,
           8.]),
 array([0.3687375 , 0.41262525, 0.45651305, 0.5004008 , 0.5442886 ,
        0.58817637, 0.6320641 , 0.6759519 , 0.7198397 , 0.7637274 ,
        0.8076152 ], dtype=float32),
 <a list of 10 Patch objects>)

The exact same as the curve we got above.

posterior_sample = tfd.Binomial(total_count=60, probs=theta_sample).sample(1)[0,0,:]

plt.hist(posterior_sample)

(array([   4.,   31.,  231.,  913., 2090., 2967., 2327., 1143.,  267.,
          27.]), array([13., 17., 21., 25., 29., 33., 37., 41., 45., 49., 53.],
       dtype=float32), <a list of 10 Patch objects>)

unique, idxs, counts = tf.unique_with_counts(posterior_sample)

plt.bar(unique, counts)

<BarContainer object of 40 artists>

sample  # peak matches sample

<tf.Tensor: shape=(1,), dtype=float32, numpy=array([35.], dtype=float32)>

Does my original model make sense with these results? Here we can say yes, since we have a peak again around 35, and roughly the same distribution.