Commit Graph

43 Commits

Author SHA1 Message Date
Andrea Pedrotti ab7a310b34 todo updates 2023-03-17 10:44:45 +01:00
Andrea Pedrotti 41647f974a last training swipe on eval set is now performed on batch size equal to the training set batch size 2023-03-17 10:44:23 +01:00
Andrea Pedrotti ee2a9481de sampling GLAMI1-M dataset 2023-03-16 18:10:05 +01:00
Andrea Pedrotti ee38bcda10 fixed TransformerGen init 2023-03-16 12:12:39 +01:00
Andrea Pedrotti b34da419d0 fixed import 2023-03-16 11:49:49 +01:00
Andrea Pedrotti 17d0003e48 getter for gFun and VGFs config 2023-03-16 11:41:40 +01:00
Andrea Pedrotti 9d43ebb23b implemented save/load for MT5ForSequenceClassification. Moved torch Datasets to datamanager module 2023-03-16 10:31:34 +01:00
Andrea Pedrotti 56faaf2615 changed wandb logging to a global level to keep track of all the VGFs and overall gFun 2023-03-15 16:35:49 +01:00
Andrea Pedrotti f32b9227ae TODO: better stratified sampling for GLAMI-1M 2023-03-15 11:48:03 +01:00
Andrea Pedrotti 65407f51fa update trainer to handle mT5 2023-03-15 11:47:17 +01:00
Andrea Pedrotti 26aa0b327a average pooling for MT5ForSequenceClassification and standardized return data 2023-03-15 11:46:53 +01:00
Andrea Pedrotti fece8d059e updated argparse 2023-03-14 11:54:40 +01:00
Andrea Pedrotti 5e41b4517a implemented MT5ForSequenceClassification 2023-03-14 11:53:50 +01:00
Andrea Pedrotti a3e183d7fc avoid duplicating model on gpu when earlystop is triggered 2023-03-14 11:22:00 +01:00
Andrea Pedrotti 57918ec523 save and load datasets as pkl 2023-03-10 12:40:26 +01:00
andreapdr 7d0d6ba1f6 log average metrics via wandb 2023-03-10 11:21:33 +01:00
andreapdr 5ef0904e0e logging average metrics 2023-03-09 17:59:18 +01:00
andreapdr 7e1ec46ebd improved wandb logging 2023-03-09 17:03:17 +01:00
Andrea Pedrotti 3240150542 updated todo 2023-03-07 17:36:21 +01:00
Andrea Pedrotti 84dd1f093e logging via wandb 2023-03-07 17:34:25 +01:00
Andrea Pedrotti 6b7917ca47 typos 2023-03-07 14:33:30 +01:00
andreapdr 7dead90271 logging via wandb 2023-03-07 14:20:56 +01:00
Andrea Pedrotti f274ec7615 moved dataloader function get_dataset 2023-03-06 12:40:12 +01:00
Andrea Pedrotti 77227bbe13 support for binary dataset; CLS dataset; updated gitignore 2023-03-06 11:59:47 +01:00
Andrea Pedrotti f9d4e50297 support for cls dataset; update requirements 2023-03-04 12:54:55 +01:00
Andrea Pedrotti 25fd67865d todo update 2023-03-02 18:20:43 +01:00
Andrea Pedrotti 0c9454cdd4 implemented multimodal pipeline; gFunDataset interface; fixed imports 2023-03-02 18:16:46 +01:00
Andrea Pedrotti 7041f7b651 fixed bug: we were applying sigmoid function 2 times when training the Attention-based aggregator 2023-02-14 14:28:17 +01:00
Andrea Pedrotti fc98bc3924 gitignore update 2023-02-13 18:51:02 +01:00
Andrea Pedrotti 298f31669d Data Classes for GLAMI-1M Dataset 2023-02-13 18:29:54 +01:00
Andrea Pedrotti 7ed98346a5 fixed loading function for Attention-based aggregating function when triggered by EarlyStopper 2023-02-13 15:01:50 +01:00
Andrea Pedrotti 930a6d8275 generated requirements file 2023-02-13 15:01:00 +01:00
Andrea Pedrotti 13ada46c34 attention-based aggregation function, first implementation, some hard-coded parameters 2023-02-10 18:29:58 +01:00
Andrea Pedrotti 2a42b21ac9 concat aggfunc 2023-02-10 12:58:26 +01:00
Andrea Pedrotti 3f3e4982e4 model checkpoint during training. Restore best model if earlystop is triggered 2023-02-10 11:37:32 +01:00
Andrea Pedrotti 9c2c43dafb Visual VGF + MultiNewsDataset, working from data loading to testing 2023-02-09 18:42:27 +01:00
Andrea Pedrotti 1a3f931c70 Merge pull request 'devel' (#1) from devel into master
Reviewed-on: #1
2023-02-09 16:56:59 +01:00
Andrea Pedrotti dba2ed9c9c Visual Transformer VGF 2023-02-09 16:55:06 +01:00
Andrea Pedrotti 4485d97e03 test commit 2023-02-09 16:47:17 +01:00
Andrea Pedrotti 8325262972 MultiNewsDataset download/save image fn + class for Visual View Generating Function 2023-02-08 18:11:53 +01:00
Andrea Pedrotti 19e4f294db better way to save/load model via id ({config}_{date}); Implemented __str__ for each VGFs + get_config in GeneralizedFunnelling 2023-02-08 16:06:24 +01:00
Andrea Pedrotti 31fb436cf0 implemented fn to save/load trained gfun 2023-02-08 14:51:56 +01:00
Andrea Pedrotti 6b75483b55 bulk upload after refactoring 2023-02-07 18:40:17 +01:00