gfun_multimodal

Commit Graph

Author	SHA1	Message	Date
Andrea Pedrotti	c63c35269a	script to run sentiment experiemnts	2023-06-22 11:33:50 +02:00
Andrea Pedrotti	2800694672	main update	2023-06-22 11:33:29 +02:00
Andrea Pedrotti	e8b6396366	gitignore update	2023-06-22 11:33:22 +02:00
Andrea Pedrotti	e3e6f061d8	transformer-trainer via huggingface	2023-06-22 11:33:06 +02:00
Andrea Pedrotti	60171c1b5e	avoid training transformers	2023-06-22 11:32:50 +02:00
Andrea Pedrotti	2554c58fac	evaluate update	2023-06-22 11:32:27 +02:00
Andrea Pedrotti	9437ccc837	webis-cls unprocessed manager	2023-06-22 11:32:15 +02:00
Andrea Pedrotti	de98926d00	todo update	2023-06-12 15:56:00 +02:00
Andrea Pedrotti	bef086ab50	setting gfun config when loading pre-trained model	2023-06-12 15:55:38 +02:00
Andrea Pedrotti	732ffbefb1	minor updates	2023-06-12 12:12:53 +02:00
Andrea Pedrotti	9ce0001047	webis-unprocessed dataset	2023-06-12 12:12:31 +02:00
Andrea Pedrotti	b3b7c69263	updated get_config of vgfs + restore model fn for mt5	2023-06-12 12:11:38 +02:00
Andrea Pedrotti	770e8e62be	branching for sentiment	2023-06-08 10:07:00 +02:00
Andrea Pedrotti	ab7a310b34	todo updates	2023-03-17 10:44:45 +01:00
Andrea Pedrotti	41647f974a	last training swipe on eval set is now performed on batch size equal to the training set batch size	2023-03-17 10:44:23 +01:00
Andrea Pedrotti	ee2a9481de	sampling GLAMI1-M dataset	2023-03-16 18:10:05 +01:00
Andrea Pedrotti	ee38bcda10	fixed TransformerGen init	2023-03-16 12:12:39 +01:00
Andrea Pedrotti	b34da419d0	fixed import	2023-03-16 11:49:49 +01:00
Andrea Pedrotti	17d0003e48	getter for gFun and VGFs config	2023-03-16 11:41:40 +01:00
Andrea Pedrotti	9d43ebb23b	implemented save/load for MT5ForSequenceClassification. Moved torch Datasets to datamanager module	2023-03-16 10:31:34 +01:00
Andrea Pedrotti	56faaf2615	changed wandb logging to a global level to keep track of all the VGFs and overall gFun	2023-03-15 16:35:49 +01:00
Andrea Pedrotti	f32b9227ae	TODO: better stratified sampling for GLAMI-1M	2023-03-15 11:48:03 +01:00
Andrea Pedrotti	65407f51fa	update trainer to handle mT5	2023-03-15 11:47:17 +01:00
Andrea Pedrotti	26aa0b327a	average pooling for MT5ForSequenceClassification and standardized return data	2023-03-15 11:46:53 +01:00
Andrea Pedrotti	fece8d059e	updated argparse	2023-03-14 11:54:40 +01:00
Andrea Pedrotti	5e41b4517a	implemented MT5ForSequenceClassification	2023-03-14 11:53:50 +01:00
Andrea Pedrotti	a3e183d7fc	avoid duplicating model on gpu when earlystop is triggered	2023-03-14 11:22:00 +01:00
Andrea Pedrotti	57918ec523	save and load datasets as pkl	2023-03-10 12:40:26 +01:00
andreapdr	7d0d6ba1f6	log average metrics via wandb	2023-03-10 11:21:33 +01:00
andreapdr	5ef0904e0e	logging average metrics	2023-03-09 17:59:18 +01:00
andreapdr	7e1ec46ebd	improved wandb logging	2023-03-09 17:03:17 +01:00
Andrea Pedrotti	3240150542	updated todo	2023-03-07 17:36:21 +01:00
Andrea Pedrotti	84dd1f093e	logging via wandb	2023-03-07 17:34:25 +01:00
Andrea Pedrotti	6b7917ca47	typos	2023-03-07 14:33:30 +01:00
andreapdr	7dead90271	logging via wandb	2023-03-07 14:20:56 +01:00
Andrea Pedrotti	f274ec7615	moved dataloader function get_dataset	2023-03-06 12:40:12 +01:00
Andrea Pedrotti	77227bbe13	support for binary dataset; CLS dataset; updated gitignore	2023-03-06 11:59:47 +01:00
Andrea Pedrotti	f9d4e50297	support for cls dataset; update requirements	2023-03-04 12:54:55 +01:00
Andrea Pedrotti	25fd67865d	todo update	2023-03-02 18:20:43 +01:00
Andrea Pedrotti	0c9454cdd4	implemented multimodal pipeline; gFunDataset interface; fixed imports	2023-03-02 18:16:46 +01:00
Andrea Pedrotti	7041f7b651	fixed bug: we were applying sigmoid function 2 times when training the Attention-based aggregator	2023-02-14 14:28:17 +01:00
Andrea Pedrotti	fc98bc3924	gitignore update	2023-02-13 18:51:02 +01:00
Andrea Pedrotti	298f31669d	Data Classes for GLAMI-1M Dataset	2023-02-13 18:29:54 +01:00
Andrea Pedrotti	7ed98346a5	fixed loading function for Attention-based aggregating function when triggered by EarlyStopper	2023-02-13 15:01:50 +01:00
Andrea Pedrotti	930a6d8275	generated requirements file	2023-02-13 15:01:00 +01:00
Andrea Pedrotti	13ada46c34	attention-based aggregation function, first implementation, some hard-coded parameters	2023-02-10 18:29:58 +01:00
Andrea Pedrotti	2a42b21ac9	concat aggfunc	2023-02-10 12:58:26 +01:00
Andrea Pedrotti	3f3e4982e4	model checkpoint during training. Restore best model if earlystop is triggered	2023-02-10 11:37:32 +01:00
Andrea Pedrotti	9c2c43dafb	Visual VGF + MultiNewsDataset, working from data loading to testing	2023-02-09 18:42:27 +01:00
Andrea Pedrotti	1a3f931c70	Merge pull request 'devel' (#1 ) from devel into master Reviewed-on: #1	2023-02-09 16:56:59 +01:00

1 2

56 Commits All Branches Search

56 Commits

All Branches