Andrea Pedrotti
|
b6b1d33fdb
|
set test key_prefix in test phase for wandb
|
2023-07-04 10:43:33 +02:00 |
Andrea Pedrotti
|
8354d76513
|
switched from mbert uncased to cased version
|
2023-07-03 19:04:26 +02:00 |
Andrea Pedrotti
|
6995854e3d
|
hardcodednlabels f or rai datasets
|
2023-07-03 19:03:42 +02:00 |
Andrea Pedrotti
|
55e12505c0
|
removed unused cols in rai dataset
|
2023-07-03 19:02:37 +02:00 |
Andrea Pedrotti
|
d36e185ffe
|
update gitignore
|
2023-07-03 19:02:12 +02:00 |
Andrea Pedrotti
|
317fb93da6
|
updates
|
2023-06-29 11:41:37 +02:00 |
Andrea Pedrotti
|
86fbd90bd4
|
handling new data
|
2023-06-29 11:41:22 +02:00 |
Andrea Pedrotti
|
1a1c48e136
|
delete categories amazon
|
2023-06-22 11:38:40 +02:00 |
Andrea Pedrotti
|
c63c35269a
|
script to run sentiment experiemnts
|
2023-06-22 11:33:50 +02:00 |
Andrea Pedrotti
|
2800694672
|
main update
|
2023-06-22 11:33:29 +02:00 |
Andrea Pedrotti
|
e8b6396366
|
gitignore update
|
2023-06-22 11:33:22 +02:00 |
Andrea Pedrotti
|
e3e6f061d8
|
transformer-trainer via huggingface
|
2023-06-22 11:33:06 +02:00 |
Andrea Pedrotti
|
60171c1b5e
|
avoid training transformers
|
2023-06-22 11:32:50 +02:00 |
Andrea Pedrotti
|
2554c58fac
|
evaluate update
|
2023-06-22 11:32:27 +02:00 |
Andrea Pedrotti
|
9437ccc837
|
webis-cls unprocessed manager
|
2023-06-22 11:32:15 +02:00 |
Andrea Pedrotti
|
de98926d00
|
todo update
|
2023-06-12 15:56:00 +02:00 |
Andrea Pedrotti
|
bef086ab50
|
setting gfun config when loading pre-trained model
|
2023-06-12 15:55:38 +02:00 |
Andrea Pedrotti
|
732ffbefb1
|
minor updates
|
2023-06-12 12:12:53 +02:00 |
Andrea Pedrotti
|
9ce0001047
|
webis-unprocessed dataset
|
2023-06-12 12:12:31 +02:00 |
Andrea Pedrotti
|
b3b7c69263
|
updated get_config of vgfs + restore model fn for mt5
|
2023-06-12 12:11:38 +02:00 |
Andrea Pedrotti
|
770e8e62be
|
branching for sentiment
|
2023-06-08 10:07:00 +02:00 |
Andrea Pedrotti
|
ab7a310b34
|
todo updates
|
2023-03-17 10:44:45 +01:00 |
Andrea Pedrotti
|
41647f974a
|
last training swipe on eval set is now performed on batch size equal to the training set batch size
|
2023-03-17 10:44:23 +01:00 |
Andrea Pedrotti
|
ee2a9481de
|
sampling GLAMI1-M dataset
|
2023-03-16 18:10:05 +01:00 |
Andrea Pedrotti
|
ee38bcda10
|
fixed TransformerGen init
|
2023-03-16 12:12:39 +01:00 |
Andrea Pedrotti
|
b34da419d0
|
fixed import
|
2023-03-16 11:49:49 +01:00 |
Andrea Pedrotti
|
17d0003e48
|
getter for gFun and VGFs config
|
2023-03-16 11:41:40 +01:00 |
Andrea Pedrotti
|
9d43ebb23b
|
implemented save/load for MT5ForSequenceClassification. Moved torch Datasets to datamanager module
|
2023-03-16 10:31:34 +01:00 |
Andrea Pedrotti
|
56faaf2615
|
changed wandb logging to a global level to keep track of all the VGFs and overall gFun
|
2023-03-15 16:35:49 +01:00 |
Andrea Pedrotti
|
f32b9227ae
|
TODO: better stratified sampling for GLAMI-1M
|
2023-03-15 11:48:03 +01:00 |
Andrea Pedrotti
|
65407f51fa
|
update trainer to handle mT5
|
2023-03-15 11:47:17 +01:00 |
Andrea Pedrotti
|
26aa0b327a
|
average pooling for MT5ForSequenceClassification and standardized return data
|
2023-03-15 11:46:53 +01:00 |
Andrea Pedrotti
|
fece8d059e
|
updated argparse
|
2023-03-14 11:54:40 +01:00 |
Andrea Pedrotti
|
5e41b4517a
|
implemented MT5ForSequenceClassification
|
2023-03-14 11:53:50 +01:00 |
Andrea Pedrotti
|
a3e183d7fc
|
avoid duplicating model on gpu when earlystop is triggered
|
2023-03-14 11:22:00 +01:00 |
Andrea Pedrotti
|
57918ec523
|
save and load datasets as pkl
|
2023-03-10 12:40:26 +01:00 |
andreapdr
|
7d0d6ba1f6
|
log average metrics via wandb
|
2023-03-10 11:21:33 +01:00 |
andreapdr
|
5ef0904e0e
|
logging average metrics
|
2023-03-09 17:59:18 +01:00 |
andreapdr
|
7e1ec46ebd
|
improved wandb logging
|
2023-03-09 17:03:17 +01:00 |
Andrea Pedrotti
|
3240150542
|
updated todo
|
2023-03-07 17:36:21 +01:00 |
Andrea Pedrotti
|
84dd1f093e
|
logging via wandb
|
2023-03-07 17:34:25 +01:00 |
Andrea Pedrotti
|
6b7917ca47
|
typos
|
2023-03-07 14:33:30 +01:00 |
andreapdr
|
7dead90271
|
logging via wandb
|
2023-03-07 14:20:56 +01:00 |
Andrea Pedrotti
|
f274ec7615
|
moved dataloader function get_dataset
|
2023-03-06 12:40:12 +01:00 |
Andrea Pedrotti
|
77227bbe13
|
support for binary dataset; CLS dataset; updated gitignore
|
2023-03-06 11:59:47 +01:00 |
Andrea Pedrotti
|
f9d4e50297
|
support for cls dataset; update requirements
|
2023-03-04 12:54:55 +01:00 |
Andrea Pedrotti
|
25fd67865d
|
todo update
|
2023-03-02 18:20:43 +01:00 |
Andrea Pedrotti
|
0c9454cdd4
|
implemented multimodal pipeline; gFunDataset interface; fixed imports
|
2023-03-02 18:16:46 +01:00 |
Andrea Pedrotti
|
7041f7b651
|
fixed bug: we were applying sigmoid function 2 times when training the Attention-based aggregator
|
2023-02-14 14:28:17 +01:00 |
Andrea Pedrotti
|
fc98bc3924
|
gitignore update
|
2023-02-13 18:51:02 +01:00 |