Commit Graph

56 Commits

Author SHA1 Message Date
Andrea Pedrotti c63c35269a script to run sentiment experiemnts 2023-06-22 11:33:50 +02:00
Andrea Pedrotti 2800694672 main update 2023-06-22 11:33:29 +02:00
Andrea Pedrotti e8b6396366 gitignore update 2023-06-22 11:33:22 +02:00
Andrea Pedrotti e3e6f061d8 transformer-trainer via huggingface 2023-06-22 11:33:06 +02:00
Andrea Pedrotti 60171c1b5e avoid training transformers 2023-06-22 11:32:50 +02:00
Andrea Pedrotti 2554c58fac evaluate update 2023-06-22 11:32:27 +02:00
Andrea Pedrotti 9437ccc837 webis-cls unprocessed manager 2023-06-22 11:32:15 +02:00
Andrea Pedrotti de98926d00 todo update 2023-06-12 15:56:00 +02:00
Andrea Pedrotti bef086ab50 setting gfun config when loading pre-trained model 2023-06-12 15:55:38 +02:00
Andrea Pedrotti 732ffbefb1 minor updates 2023-06-12 12:12:53 +02:00
Andrea Pedrotti 9ce0001047 webis-unprocessed dataset 2023-06-12 12:12:31 +02:00
Andrea Pedrotti b3b7c69263 updated get_config of vgfs + restore model fn for mt5 2023-06-12 12:11:38 +02:00
Andrea Pedrotti 770e8e62be branching for sentiment 2023-06-08 10:07:00 +02:00
Andrea Pedrotti ab7a310b34 todo updates 2023-03-17 10:44:45 +01:00
Andrea Pedrotti 41647f974a last training swipe on eval set is now performed on batch size equal to the training set batch size 2023-03-17 10:44:23 +01:00
Andrea Pedrotti ee2a9481de sampling GLAMI1-M dataset 2023-03-16 18:10:05 +01:00
Andrea Pedrotti ee38bcda10 fixed TransformerGen init 2023-03-16 12:12:39 +01:00
Andrea Pedrotti b34da419d0 fixed import 2023-03-16 11:49:49 +01:00
Andrea Pedrotti 17d0003e48 getter for gFun and VGFs config 2023-03-16 11:41:40 +01:00
Andrea Pedrotti 9d43ebb23b implemented save/load for MT5ForSequenceClassification. Moved torch Datasets to datamanager module 2023-03-16 10:31:34 +01:00
Andrea Pedrotti 56faaf2615 changed wandb logging to a global level to keep track of all the VGFs and overall gFun 2023-03-15 16:35:49 +01:00
Andrea Pedrotti f32b9227ae TODO: better stratified sampling for GLAMI-1M 2023-03-15 11:48:03 +01:00
Andrea Pedrotti 65407f51fa update trainer to handle mT5 2023-03-15 11:47:17 +01:00
Andrea Pedrotti 26aa0b327a average pooling for MT5ForSequenceClassification and standardized return data 2023-03-15 11:46:53 +01:00
Andrea Pedrotti fece8d059e updated argparse 2023-03-14 11:54:40 +01:00
Andrea Pedrotti 5e41b4517a implemented MT5ForSequenceClassification 2023-03-14 11:53:50 +01:00
Andrea Pedrotti a3e183d7fc avoid duplicating model on gpu when earlystop is triggered 2023-03-14 11:22:00 +01:00
Andrea Pedrotti 57918ec523 save and load datasets as pkl 2023-03-10 12:40:26 +01:00
andreapdr 7d0d6ba1f6 log average metrics via wandb 2023-03-10 11:21:33 +01:00
andreapdr 5ef0904e0e logging average metrics 2023-03-09 17:59:18 +01:00
andreapdr 7e1ec46ebd improved wandb logging 2023-03-09 17:03:17 +01:00
Andrea Pedrotti 3240150542 updated todo 2023-03-07 17:36:21 +01:00
Andrea Pedrotti 84dd1f093e logging via wandb 2023-03-07 17:34:25 +01:00
Andrea Pedrotti 6b7917ca47 typos 2023-03-07 14:33:30 +01:00
andreapdr 7dead90271 logging via wandb 2023-03-07 14:20:56 +01:00
Andrea Pedrotti f274ec7615 moved dataloader function get_dataset 2023-03-06 12:40:12 +01:00
Andrea Pedrotti 77227bbe13 support for binary dataset; CLS dataset; updated gitignore 2023-03-06 11:59:47 +01:00
Andrea Pedrotti f9d4e50297 support for cls dataset; update requirements 2023-03-04 12:54:55 +01:00
Andrea Pedrotti 25fd67865d todo update 2023-03-02 18:20:43 +01:00
Andrea Pedrotti 0c9454cdd4 implemented multimodal pipeline; gFunDataset interface; fixed imports 2023-03-02 18:16:46 +01:00
Andrea Pedrotti 7041f7b651 fixed bug: we were applying sigmoid function 2 times when training the Attention-based aggregator 2023-02-14 14:28:17 +01:00
Andrea Pedrotti fc98bc3924 gitignore update 2023-02-13 18:51:02 +01:00
Andrea Pedrotti 298f31669d Data Classes for GLAMI-1M Dataset 2023-02-13 18:29:54 +01:00
Andrea Pedrotti 7ed98346a5 fixed loading function for Attention-based aggregating function when triggered by EarlyStopper 2023-02-13 15:01:50 +01:00
Andrea Pedrotti 930a6d8275 generated requirements file 2023-02-13 15:01:00 +01:00
Andrea Pedrotti 13ada46c34 attention-based aggregation function, first implementation, some hard-coded parameters 2023-02-10 18:29:58 +01:00
Andrea Pedrotti 2a42b21ac9 concat aggfunc 2023-02-10 12:58:26 +01:00
Andrea Pedrotti 3f3e4982e4 model checkpoint during training. Restore best model if earlystop is triggered 2023-02-10 11:37:32 +01:00
Andrea Pedrotti 9c2c43dafb Visual VGF + MultiNewsDataset, working from data loading to testing 2023-02-09 18:42:27 +01:00
Andrea Pedrotti 1a3f931c70 Merge pull request 'devel' (#1) from devel into master
Reviewed-on: #1
2023-02-09 16:56:59 +01:00