Andrea Pedrotti
|
35cc32e541
|
fixed inference script
|
2024-03-12 11:38:12 +01:00 |
Andrea Pedrotti
|
4615bc3857
|
dataset get_name
|
2023-11-06 10:52:07 +01:00 |
Andrea Pedrotti
|
1b58fed14d
|
removed unused
|
2023-10-29 18:18:01 +01:00 |
Andrea Pedrotti
|
41ba20ad5c
|
script for simpler inference
|
2023-10-29 18:15:01 +01:00 |
Andrea Pedrotti
|
5d07e579e4
|
minor updates
|
2023-10-05 15:58:12 +02:00 |
Andrea Pedrotti
|
875af6d362
|
removed unused
|
2023-10-05 15:46:54 +02:00 |
Andrea Pedrotti
|
22a36e5ddf
|
removed unused
|
2023-10-05 15:44:16 +02:00 |
Andrea Pedrotti
|
debf41d177
|
reqs update
|
2023-10-05 15:43:15 +02:00 |
Andrea Pedrotti
|
12219ffa2a
|
update to run.sh
|
2023-10-05 15:41:37 +02:00 |
Andrea Pedrotti
|
234b6031b1
|
branching for rai
|
2023-10-05 15:39:49 +02:00 |
Andrea Pedrotti
|
fbd740fabd
|
bulk update: zero-shot + csvlogger + simpler dataset class + rai experiments
|
2023-08-03 19:31:03 +02:00 |
Andrea Pedrotti
|
ae92199613
|
gitignore update
|
2023-08-03 19:30:13 +02:00 |
Andrea Pedrotti
|
b6b1d33fdb
|
set test key_prefix in test phase for wandb
|
2023-07-04 10:43:33 +02:00 |
Andrea Pedrotti
|
8354d76513
|
switched from mbert uncased to cased version
|
2023-07-03 19:04:26 +02:00 |
Andrea Pedrotti
|
6995854e3d
|
hardcodednlabels f or rai datasets
|
2023-07-03 19:03:42 +02:00 |
Andrea Pedrotti
|
55e12505c0
|
removed unused cols in rai dataset
|
2023-07-03 19:02:37 +02:00 |
Andrea Pedrotti
|
d36e185ffe
|
update gitignore
|
2023-07-03 19:02:12 +02:00 |
Andrea Pedrotti
|
317fb93da6
|
updates
|
2023-06-29 11:41:37 +02:00 |
Andrea Pedrotti
|
86fbd90bd4
|
handling new data
|
2023-06-29 11:41:22 +02:00 |
Andrea Pedrotti
|
1a1c48e136
|
delete categories amazon
|
2023-06-22 11:38:40 +02:00 |
Andrea Pedrotti
|
c63c35269a
|
script to run sentiment experiemnts
|
2023-06-22 11:33:50 +02:00 |
Andrea Pedrotti
|
2800694672
|
main update
|
2023-06-22 11:33:29 +02:00 |
Andrea Pedrotti
|
e8b6396366
|
gitignore update
|
2023-06-22 11:33:22 +02:00 |
Andrea Pedrotti
|
e3e6f061d8
|
transformer-trainer via huggingface
|
2023-06-22 11:33:06 +02:00 |
Andrea Pedrotti
|
60171c1b5e
|
avoid training transformers
|
2023-06-22 11:32:50 +02:00 |
Andrea Pedrotti
|
2554c58fac
|
evaluate update
|
2023-06-22 11:32:27 +02:00 |
Andrea Pedrotti
|
9437ccc837
|
webis-cls unprocessed manager
|
2023-06-22 11:32:15 +02:00 |
Andrea Pedrotti
|
de98926d00
|
todo update
|
2023-06-12 15:56:00 +02:00 |
Andrea Pedrotti
|
bef086ab50
|
setting gfun config when loading pre-trained model
|
2023-06-12 15:55:38 +02:00 |
Andrea Pedrotti
|
732ffbefb1
|
minor updates
|
2023-06-12 12:12:53 +02:00 |
Andrea Pedrotti
|
9ce0001047
|
webis-unprocessed dataset
|
2023-06-12 12:12:31 +02:00 |
Andrea Pedrotti
|
b3b7c69263
|
updated get_config of vgfs + restore model fn for mt5
|
2023-06-12 12:11:38 +02:00 |
Andrea Pedrotti
|
770e8e62be
|
branching for sentiment
|
2023-06-08 10:07:00 +02:00 |
Andrea Pedrotti
|
ab7a310b34
|
todo updates
|
2023-03-17 10:44:45 +01:00 |
Andrea Pedrotti
|
41647f974a
|
last training swipe on eval set is now performed on batch size equal to the training set batch size
|
2023-03-17 10:44:23 +01:00 |
Andrea Pedrotti
|
ee2a9481de
|
sampling GLAMI1-M dataset
|
2023-03-16 18:10:05 +01:00 |
Andrea Pedrotti
|
ee38bcda10
|
fixed TransformerGen init
|
2023-03-16 12:12:39 +01:00 |
Andrea Pedrotti
|
b34da419d0
|
fixed import
|
2023-03-16 11:49:49 +01:00 |
Andrea Pedrotti
|
17d0003e48
|
getter for gFun and VGFs config
|
2023-03-16 11:41:40 +01:00 |
Andrea Pedrotti
|
9d43ebb23b
|
implemented save/load for MT5ForSequenceClassification. Moved torch Datasets to datamanager module
|
2023-03-16 10:31:34 +01:00 |
Andrea Pedrotti
|
56faaf2615
|
changed wandb logging to a global level to keep track of all the VGFs and overall gFun
|
2023-03-15 16:35:49 +01:00 |
Andrea Pedrotti
|
f32b9227ae
|
TODO: better stratified sampling for GLAMI-1M
|
2023-03-15 11:48:03 +01:00 |
Andrea Pedrotti
|
65407f51fa
|
update trainer to handle mT5
|
2023-03-15 11:47:17 +01:00 |
Andrea Pedrotti
|
26aa0b327a
|
average pooling for MT5ForSequenceClassification and standardized return data
|
2023-03-15 11:46:53 +01:00 |
Andrea Pedrotti
|
fece8d059e
|
updated argparse
|
2023-03-14 11:54:40 +01:00 |
Andrea Pedrotti
|
5e41b4517a
|
implemented MT5ForSequenceClassification
|
2023-03-14 11:53:50 +01:00 |
Andrea Pedrotti
|
a3e183d7fc
|
avoid duplicating model on gpu when earlystop is triggered
|
2023-03-14 11:22:00 +01:00 |
Andrea Pedrotti
|
57918ec523
|
save and load datasets as pkl
|
2023-03-10 12:40:26 +01:00 |
andreapdr
|
7d0d6ba1f6
|
log average metrics via wandb
|
2023-03-10 11:21:33 +01:00 |
andreapdr
|
5ef0904e0e
|
logging average metrics
|
2023-03-09 17:59:18 +01:00 |