Andrea Pedrotti
|
b3b7c69263
|
updated get_config of vgfs + restore model fn for mt5
|
2023-06-12 12:11:38 +02:00 |
Andrea Pedrotti
|
ab7a310b34
|
todo updates
|
2023-03-17 10:44:45 +01:00 |
Andrea Pedrotti
|
ee38bcda10
|
fixed TransformerGen init
|
2023-03-16 12:12:39 +01:00 |
Andrea Pedrotti
|
b34da419d0
|
fixed import
|
2023-03-16 11:49:49 +01:00 |
Andrea Pedrotti
|
17d0003e48
|
getter for gFun and VGFs config
|
2023-03-16 11:41:40 +01:00 |
Andrea Pedrotti
|
9d43ebb23b
|
implemented save/load for MT5ForSequenceClassification. Moved torch Datasets to datamanager module
|
2023-03-16 10:31:34 +01:00 |
andreapdr
|
7e1ec46ebd
|
improved wandb logging
|
2023-03-09 17:03:17 +01:00 |
andreapdr
|
7dead90271
|
logging via wandb
|
2023-03-07 14:20:56 +01:00 |
Andrea Pedrotti
|
0c9454cdd4
|
implemented multimodal pipeline; gFunDataset interface; fixed imports
|
2023-03-02 18:16:46 +01:00 |
Andrea Pedrotti
|
7ed98346a5
|
fixed loading function for Attention-based aggregating function when triggered by EarlyStopper
|
2023-02-13 15:01:50 +01:00 |
Andrea Pedrotti
|
13ada46c34
|
attention-based aggregation function, first implementation, some hard-coded parameters
|
2023-02-10 18:29:58 +01:00 |
Andrea Pedrotti
|
2a42b21ac9
|
concat aggfunc
|
2023-02-10 12:58:26 +01:00 |
Andrea Pedrotti
|
3f3e4982e4
|
model checkpoint during training. Restore best model if earlystop is triggered
|
2023-02-10 11:37:32 +01:00 |
Andrea Pedrotti
|
9c2c43dafb
|
Visual VGF + MultiNewsDataset, working from data loading to testing
|
2023-02-09 18:42:27 +01:00 |
Andrea Pedrotti
|
dba2ed9c9c
|
Visual Transformer VGF
|
2023-02-09 16:55:06 +01:00 |