from k1lib.imports import *


class SkipBlock(nn.Module):
    def __init__(self, hiddenDim=10):
        super().__init__()
        def gen(): return nn.Linear(hiddenDim, hiddenDim), nn.LeakyReLU()
        self.seq = nn.Sequential(*gen(), *gen(), *gen())
    def forward(self, x):
        return self.seq(x) + x


class Network(nn.Module):
    def __init__(self, hiddenDim=10, blocks=3, block=SkipBlock):
        super().__init__()
        layers = [nn.Linear(1, hiddenDim), nn.LeakyReLU()]
        layers += [block(hiddenDim) for _ in range(blocks)]
        layers += [nn.Linear(hiddenDim, 1)]
        self.bulk = nn.Sequential(*layers)
    def forward(self, x):
        return self.bulk(x)


def newL(*args, **kwargs):
    l = k1lib.Learner()
    l.model = Network(*args, **kwargs)
    l.data = k1lib.data.Data.fromDataset(k1lib.data.FunctionDataset.exp, batchSize=64)
    l.opt = optim.Adam(l.model.parameters(), lr=1e-2)
    l.lossF = lambda x, y: ((x.squeeze() - y)**2).mean()

    l.cbs.withModifyBatch(lambda x, y: (x[:, None], y))
    l.cbs.withDType(torch.float32);
    l.cbs.withCancelOnLowLoss(1, epochMode=True)
    l.css = """SkipBlock #0: HookParam
SkipBlock: HookModule"""

    def evaluate(self):
        xbs, ybs, ys = self.Recorder.record(1, 3)
        xbs = torch.vstack(xbs).squeeze()
        ybs = torch.vstack([yb[:, None] for yb in ybs]).squeeze()
        ys = torch.vstack(ys).squeeze()
        plt.plot(xbs, ys.detach(), ".")
    l.evaluate = partial(evaluate, l)
    return l
l = newL()
l.run(10);

Progress:  60%, epoch:  5/10, batch: 154/157, elapsed:   2.76s, loss: 0.36690178513526917             Run cancelled: Low loss 1 ([19.678720270433733, 2.7657265547783143, 0.7560878290284064] actual) achieved!.


l.cbs

Callbacks:
- CancelOnExplosion
- CancelOnLowLoss
- CoreNormal
- DType
- DontTrainValid
- HookModule
- HookParam
- Loss
- LossLambda
- ModifyBatch
- ParamFinder
- Profiler
- ProgressBar
- Recorder

Use...
- cbs.append(cb[, name]): to add a callback with a name
- cbs("startRun"): to trigger a specific checkpoint, this case "startRun"
- cbs.Loss: to get a specific callback by name, this case "Loss"
- cbs[i]: to get specific callback by index
- cbs.timings: to get callback execution times
- cbs.checkpointGraph(): to graph checkpoint calling orders
- cbs.context(): context manager that will detach all Callbacks attached inside the context
- cbs.suspend("Loss", "Cuda"): context manager to temporarily prevent triggering checkpoints
- cbs.withs: to get list of with- functions. Corresponding classes are in k1lib.Callback.cls


l = newL(); l.ParamFinder.plot(samples=1000)[:0.99]

Progress:   0%, epoch:    3/1000, batch:   8/157, elapsed:   1.03s, loss: 42.160526275634766             Run cancelled: Loss increases significantly.
Suggested param: 0.0020825221242725925

Sliceable plot. Can...
- p[a:b]: to focus on a specific range of the plot
- p.yscale("log"): to perform operation as if you're using plt

Reminder: slice range here is actually [0, 1], because it's kinda hard to slice the normal way


l = newL(); l.run(10); l.Loss

Progress:  30%, epoch:  2/10, batch: 155/157, elapsed:   1.42s, loss: 1.2370659112930298             Run cancelled: Low loss 1 ([25.620578212122762, 1.6966487890289677, 0.9890819071762024] actual) achieved!.

Callback `Loss`, use...
- cb.train: for all training losses over all epochs and batches (#epochs * #batches)
- cb.valid: for all validation losses over all epochs and batches (#epochs * #batches)
- cb.plot(): to plot the 2 above
- cb.epoch: for average losses of each epochs
- cb.Landscape: for loss-landscape-plotting Callback
- cb.something: to get specific attribute "something" from learner if not available
- cb.withCheckpoint(checkpoint, f): to quickly insert an event handler
- cb.detach(): to remove itself from its parent Callbacks


l.Loss.plot()

Sliceable plot. Can...
- p[a:b]: to focus on a specific range of the plot
- p.yscale("log"): to perform operation as if you're using plt

Reminder: the actual slice you put in is for the training plot. The valid loss's plot will update automatically to be in the same time frame


l.Loss.plot()[120:]

Sliceable plot. Can...
- p[a:b]: to focus on a specific range of the plot
- p.yscale("log"): to perform operation as if you're using plt

Reminder: the actual slice you put in is for the training plot. The valid loss's plot will update automatically to be in the same time frame


l.Loss.Landscape.plot()

Progress: 100%, 13s           8/8 Finished [-2.818, 2.818] range              Run cancelled: Landscape finished.


l.Loss.Landscape.plot()

Progress: 100%, 12s           8/8 Finished [-2.818, 2.818] range              Run cancelled: Landscape finished.


l.HookParam

Callback `HookParam`: 6 params, 472 means and stds each:
  0. bulk.2.seq.0.weight
  1. bulk.2.seq.0.bias
  2. bulk.3.seq.0.weight
  3. bulk.3.seq.0.bias
  4. bulk.4.seq.0.weight
  5. bulk.4.seq.0.bias

Use...
- p.plot(): to quickly look at everything
- p[i]: to view a single param
- p[a:b]: to get a new HookParam with selected params
- p.css("..."): to select a specific subset of modules only
- cb.something: to get specific attribute "something" from learner if not available
- cb.withCheckpoint(checkpoint, f): to quickly insert an event handler
- cb.detach(): to remove itself from its parent Callbacks


l.HookParam.plot()

Sliceable plot. Can...
- p[a:b]: to focus on a specific range of the plot
- p.yscale("log"): to perform operation as if you're using plt


l.HookParam[::2].plot()[50:]

Sliceable plot. Can...
- p[a:b]: to focus on a specific range of the plot
- p.yscale("log"): to perform operation as if you're using plt


l.HookModule.plot()

Sliceable plot. Can...
- p[a:b]: to focus on a specific range of the plot
- p.yscale("log"): to perform operation as if you're using plt


l.selector

ModuleSelector:
root: Network                       
    bulk: Sequential                
        0: Linear                       
        1: LeakyReLU                    
        2: SkipBlock                HookModule
            seq: Sequential         
                0: Linear           HookParam    
                1: LeakyReLU            
                2: Linear               
                3: LeakyReLU            
                4: Linear               
                5: LeakyReLU            
        3: SkipBlock                HookModule
            seq: Sequential         
                0: Linear           HookParam    
                1: LeakyReLU            
                2: Linear               
                3: LeakyReLU            
                4: Linear               
                5: LeakyReLU            
        4: SkipBlock                HookModule
            seq: Sequential         
                0: Linear           HookParam    
                1: LeakyReLU            
                2: Linear               
                3: LeakyReLU            
                4: Linear               
                5: LeakyReLU            
        5: Linear                       

Can...
- mS.displayF = ...: sets a display function (mS -> str) for self and all descendants. Defaults to displaying all props
- mS.deepestDepth: get deepest depth possible
- mS.nnModule: get the underlying nn.Module object
- mS.apply(f): apply to self and all descendants
- mS.copy(): copy everything, including descendants
- mS.selected("HookModule"): whether this module has a specified prop
- mS.highlight(prop): highlights all modules with specified prop
- mS.parse([..., ...]): parses extra css
- mS.clearProps(): to clear all selected props, including descendants
- mS.directParams(): get Dict[str, nn.Parameter] that are directly under this module
- mS.named_children(), mS.children(): like PyTorch
- mS.named_modules([prop]), mS.modules([prop]): like PyTorch. Optional filter prop
- mS.parameters(): like PyTorch


l = newL()
l.css = """
#bulk > Linear: a
#bulk > #1: b
SkipBlock Sequential: c
SkipBlock LeakyReLU
"""
l.selector

ModuleSelector:
root: Network                       
    bulk: Sequential                
        0: Linear                   a    
        1: LeakyReLU                b    
        2: SkipBlock                
            seq: Sequential         c
                0: Linear               
                1: LeakyReLU        all    
                2: Linear               
                3: LeakyReLU        all    
                4: Linear               
                5: LeakyReLU        all    
        3: SkipBlock                
            seq: Sequential         c
                0: Linear               
                1: LeakyReLU        all    
                2: Linear               
                3: LeakyReLU        all    
                4: Linear               
                5: LeakyReLU        all    
        4: SkipBlock                
            seq: Sequential         c
                0: Linear               
                1: LeakyReLU        all    
                2: Linear               
                3: LeakyReLU        all    
                4: Linear               
                5: LeakyReLU        all    
        5: Linear                   a    

Can...
- mS.displayF = ...: sets a display function (mS -> str) for self and all descendants. Defaults to displaying all props
- mS.deepestDepth: get deepest depth possible
- mS.nnModule: get the underlying nn.Module object
- mS.apply(f): apply to self and all descendants
- mS.copy(): copy everything, including descendants
- mS.selected("HookModule"): whether this module has a specified prop
- mS.highlight(prop): highlights all modules with specified prop
- mS.parse([..., ...]): parses extra css
- mS.clearProps(): to clear all selected props, including descendants
- mS.directParams(): get Dict[str, nn.Parameter] that are directly under this module
- mS.named_children(), mS.children(): like PyTorch
- mS.named_modules([prop]), mS.modules([prop]): like PyTorch. Optional filter prop
- mS.parameters(): like PyTorch


l.data

`Data` object, just a shell containing 2 `DataLoader`s: `.train` and `.valid`


l.data.train

DataLoader object. 126 batches total, can...
- len(dl): to get number of batches the sampler has
- dl[:80]: to get a new DataLoader with only the first 80 batches
- dl[2]: to get the third batch
- for data in dl: print(data)
- it = iter(dl); data = next(it)

l

l.model:
    Network(
      (bulk): Sequential(
        (0): Linear(in_features=1, out_features=10, bias=True)
        (1): LeakyReLU(negative_slope=0.01)
        (2): SkipBlock(
          (seq): Sequential(
            (0): Linear(in_features=10, out_features=10, bias=True)
            (1): LeakyReLU(negative_slope=0.01)
            (2): Linear(in_features=10, out_features=10, bias=True)
            (3): LeakyReLU(negative_slope=0.01)
    .....
l.opt:
    Adam (
    Parameter Group 0
        amsgrad: False
        betas: (0.9, 0.999)
        eps: 1e-08
        lr: 0.01
        weight_decay: 0
    )
l.cbs:
    Callbacks:
    - CancelOnExplosion
    - CancelOnLowLoss
    - CoreNormal
    - DType
    - DontTrainValid
    - HookModule
    - HookParam
    - Loss
    - LossLambda
    .....
Use...
- l.model = ...: to specify a nn.Module object
- l.data = ...: to specify data object
- l.opt = ...: to specify an optimizer
- l.lossF = ...: to specify a loss function
- l.css = ...: to select modules using CSS. "#root" for root model
- l.cbs = ...: to use a custom `Callbacks` object
- l.selector: to get the modules selected by `l.css`
- l.run(epochs): to run the network
- l.Loss: to get a specific callback, this case "Loss"

k1lib¶

Overview¶

ParamFinder¶

Loss¶

LossLandscape¶

HookParam¶

HookModule¶

CSS module selector¶

Data loader¶

Callbacks¶