Commit Graph

15 Commits

Author SHA1 Message Date
Andrea Pedrotti 65407f51fa update trainer to handle mT5 2023-03-15 11:47:17 +01:00
Andrea Pedrotti a3e183d7fc avoid duplicating model on gpu when earlystop is triggered 2023-03-14 11:22:00 +01:00
Andrea Pedrotti 57918ec523 save and load datasets as pkl 2023-03-10 12:40:26 +01:00
andreapdr 7d0d6ba1f6 log average metrics via wandb 2023-03-10 11:21:33 +01:00
andreapdr 5ef0904e0e logging average metrics 2023-03-09 17:59:18 +01:00
andreapdr 7e1ec46ebd improved wandb logging 2023-03-09 17:03:17 +01:00
Andrea Pedrotti 84dd1f093e logging via wandb 2023-03-07 17:34:25 +01:00
andreapdr 7dead90271 logging via wandb 2023-03-07 14:20:56 +01:00
Andrea Pedrotti 0c9454cdd4 implemented multimodal pipeline; gFunDataset interface; fixed imports 2023-03-02 18:16:46 +01:00
Andrea Pedrotti 7041f7b651 fixed bug: we were applying sigmoid function 2 times when training the Attention-based aggregator 2023-02-14 14:28:17 +01:00
Andrea Pedrotti 7ed98346a5 fixed loading function for Attention-based aggregating function when triggered by EarlyStopper 2023-02-13 15:01:50 +01:00
Andrea Pedrotti 13ada46c34 attention-based aggregation function, first implementation, some hard-coded parameters 2023-02-10 18:29:58 +01:00
Andrea Pedrotti 3f3e4982e4 model checkpoint during training. Restore best model if earlystop is triggered 2023-02-10 11:37:32 +01:00
Andrea Pedrotti dba2ed9c9c Visual Transformer VGF 2023-02-09 16:55:06 +01:00
Andrea Pedrotti 6b75483b55 bulk upload after refactoring 2023-02-07 18:40:17 +01:00