word_one_hot = tensor([0 if i != word else 1
                       for i in range(VOCAB)])
embedding = (layer1 * word_one_hot).sum(1)

for i in range(len(out)):
       count(i, out_shape, out_index)
       broadcast_index(out_index, out_shape, in_shape, in_index)
       o = index_to_position(out_index, out_strides)
       j = index_to_position(in_index, in_strides)
       out[o] = fn(in_storage[j])

out[o] = in_storage[j] + 3

def my_code(x, y):
     for i in range(100):
         x[i] = y + 20
  ...
  my_code(x, y)
  fast_my_code = numba.njit()(my_code)
  fast_my_code(x, y)
  fast_my_code(x, y)

def my_code(x, y):
     for i in prange(100):
         x[i] = y + 20
  ...
  my_code(x, y)
  fast_my_code = numba.njit(parallel=True)(my_code)
  fast_my_code(x, y)
  fast_my_code(x, y)

def my_code(x, y):
     for i in prange(100):
         x[i] = y + 20
  ...
  my_code(x, y)
  fast_my_code = numba.njit(parallel=True)(my_code)
  fast_my_code(x, y)
  fast_my_code(x, y)

Module 3.1 - Efficiency¶

Data¶

What is a word?¶

Layer 1¶

Hidden vector for word¶

Where do these come from?¶

Full Model¶

Quiz¶

Cult of Efficiency¶

Context¶

Goal¶

Code¶

Why are Python (and friends) "slow"?¶

Function Calls¶

Types¶

Loops¶

Notebook¶

Other¶

Fast Math¶

Numba¶

How does it work?¶

Notebook¶

Terminology : JIT Compiler¶

Terminology : LLVM¶

What do we lose?¶

Strategy¶

Code Transformation¶

Parallel¶

Parallel¶

Parallel Range¶

Code Transformation¶

Nondeterminism¶

Parallel Bugs¶

Parallel Diagnostics¶

Module 3.1 - Efficiency¶

Data¶

What is a word?¶

Layer 1¶

Hidden vector for word¶

How does this share information?¶

Where do these come from?¶

Full Model¶

Quiz¶

Cult of Efficiency¶

Context¶

Goal¶

Code¶

Why are Python (and friends) "slow"?¶

Function Calls¶

Types¶

Loops¶

Notebook¶

Other¶

Fast Math¶

Numba¶

How does it work?¶

Notebook¶

Terminology : JIT Compiler¶

Terminology : LLVM¶

What do we lose?¶

Strategy¶

Code Transformation¶

Parallel¶

Parallel¶

Parallel Range¶

Code Transformation¶

Nondeterminism¶

Parallel Bugs¶

Parallel Diagnostics¶