Commit Graph

76 Commits

Author SHA1 Message Date
Andrea Pedrotti 35cc32e541 fixed inference script 2024-03-12 11:38:12 +01:00
Andrea Pedrotti 4615bc3857 dataset get_name 2023-11-06 10:52:07 +01:00
Andrea Pedrotti 1b58fed14d removed unused 2023-10-29 18:18:01 +01:00
Andrea Pedrotti 41ba20ad5c script for simpler inference 2023-10-29 18:15:01 +01:00
Andrea Pedrotti 5d07e579e4 minor updates 2023-10-05 15:58:12 +02:00
Andrea Pedrotti 875af6d362 removed unused 2023-10-05 15:46:54 +02:00
Andrea Pedrotti 22a36e5ddf removed unused 2023-10-05 15:44:16 +02:00
Andrea Pedrotti debf41d177 reqs update 2023-10-05 15:43:15 +02:00
Andrea Pedrotti 12219ffa2a update to run.sh 2023-10-05 15:41:37 +02:00
Andrea Pedrotti 234b6031b1 branching for rai 2023-10-05 15:39:49 +02:00
Andrea Pedrotti fbd740fabd bulk update: zero-shot + csvlogger + simpler dataset class + rai experiments 2023-08-03 19:31:03 +02:00
Andrea Pedrotti ae92199613 gitignore update 2023-08-03 19:30:13 +02:00
Andrea Pedrotti b6b1d33fdb set test key_prefix in test phase for wandb 2023-07-04 10:43:33 +02:00
Andrea Pedrotti 8354d76513 switched from mbert uncased to cased version 2023-07-03 19:04:26 +02:00
Andrea Pedrotti 6995854e3d hardcodednlabels f or rai datasets 2023-07-03 19:03:42 +02:00
Andrea Pedrotti 55e12505c0 removed unused cols in rai dataset 2023-07-03 19:02:37 +02:00
Andrea Pedrotti d36e185ffe update gitignore 2023-07-03 19:02:12 +02:00
Andrea Pedrotti 317fb93da6 updates 2023-06-29 11:41:37 +02:00
Andrea Pedrotti 86fbd90bd4 handling new data 2023-06-29 11:41:22 +02:00
Andrea Pedrotti 1a1c48e136 delete categories amazon 2023-06-22 11:38:40 +02:00
Andrea Pedrotti c63c35269a script to run sentiment experiemnts 2023-06-22 11:33:50 +02:00
Andrea Pedrotti 2800694672 main update 2023-06-22 11:33:29 +02:00
Andrea Pedrotti e8b6396366 gitignore update 2023-06-22 11:33:22 +02:00
Andrea Pedrotti e3e6f061d8 transformer-trainer via huggingface 2023-06-22 11:33:06 +02:00
Andrea Pedrotti 60171c1b5e avoid training transformers 2023-06-22 11:32:50 +02:00
Andrea Pedrotti 2554c58fac evaluate update 2023-06-22 11:32:27 +02:00
Andrea Pedrotti 9437ccc837 webis-cls unprocessed manager 2023-06-22 11:32:15 +02:00
Andrea Pedrotti de98926d00 todo update 2023-06-12 15:56:00 +02:00
Andrea Pedrotti bef086ab50 setting gfun config when loading pre-trained model 2023-06-12 15:55:38 +02:00
Andrea Pedrotti 732ffbefb1 minor updates 2023-06-12 12:12:53 +02:00
Andrea Pedrotti 9ce0001047 webis-unprocessed dataset 2023-06-12 12:12:31 +02:00
Andrea Pedrotti b3b7c69263 updated get_config of vgfs + restore model fn for mt5 2023-06-12 12:11:38 +02:00
Andrea Pedrotti 770e8e62be branching for sentiment 2023-06-08 10:07:00 +02:00
Andrea Pedrotti ab7a310b34 todo updates 2023-03-17 10:44:45 +01:00
Andrea Pedrotti 41647f974a last training swipe on eval set is now performed on batch size equal to the training set batch size 2023-03-17 10:44:23 +01:00
Andrea Pedrotti ee2a9481de sampling GLAMI1-M dataset 2023-03-16 18:10:05 +01:00
Andrea Pedrotti ee38bcda10 fixed TransformerGen init 2023-03-16 12:12:39 +01:00
Andrea Pedrotti b34da419d0 fixed import 2023-03-16 11:49:49 +01:00
Andrea Pedrotti 17d0003e48 getter for gFun and VGFs config 2023-03-16 11:41:40 +01:00
Andrea Pedrotti 9d43ebb23b implemented save/load for MT5ForSequenceClassification. Moved torch Datasets to datamanager module 2023-03-16 10:31:34 +01:00
Andrea Pedrotti 56faaf2615 changed wandb logging to a global level to keep track of all the VGFs and overall gFun 2023-03-15 16:35:49 +01:00
Andrea Pedrotti f32b9227ae TODO: better stratified sampling for GLAMI-1M 2023-03-15 11:48:03 +01:00
Andrea Pedrotti 65407f51fa update trainer to handle mT5 2023-03-15 11:47:17 +01:00
Andrea Pedrotti 26aa0b327a average pooling for MT5ForSequenceClassification and standardized return data 2023-03-15 11:46:53 +01:00
Andrea Pedrotti fece8d059e updated argparse 2023-03-14 11:54:40 +01:00
Andrea Pedrotti 5e41b4517a implemented MT5ForSequenceClassification 2023-03-14 11:53:50 +01:00
Andrea Pedrotti a3e183d7fc avoid duplicating model on gpu when earlystop is triggered 2023-03-14 11:22:00 +01:00
Andrea Pedrotti 57918ec523 save and load datasets as pkl 2023-03-10 12:40:26 +01:00
andreapdr 7d0d6ba1f6 log average metrics via wandb 2023-03-10 11:21:33 +01:00
andreapdr 5ef0904e0e logging average metrics 2023-03-09 17:59:18 +01:00