Commit Graph

47 Commits

Author SHA1 Message Date
Alejandro Moreo Fernandez 643a19228b data reader for lequa 2022 competition 2022-11-28 12:02:08 +01:00
Alejandro Moreo Fernandez f2550fdb82 full example of training, model selection, and evaluation using the lequa2022 dataset with the new protocols 2022-11-04 15:04:36 +01:00
Alejandro Moreo Fernandez f4a2a94ba5 fixing random_state in base and in protocols 2022-06-21 10:27:06 +02:00
Alejandro Moreo Fernandez a7c768bb40 param fix 2022-06-16 16:38:34 +02:00
Alejandro Moreo Fernandez 789b9d5fbc pathfix in lequa2022 datasets 2022-06-15 14:36:02 +02:00
Alejandro Moreo Fernandez 2cc7db60cc updating parallel policy to take n_jobs from environment (not yet tested) 2022-06-14 09:35:39 +02:00
Alejandro Moreo Fernandez 82a01478ec collator functions in protocols for preparing the outputs 2022-06-03 18:02:52 +02:00
Alejandro Moreo Fernandez bfe4b8b51a updating properties of labelled collection 2022-06-03 13:51:22 +02:00
Alejandro Moreo Fernandez 45642ad778 lequa as dataset 2022-06-01 18:28:59 +02:00
Alejandro Moreo Fernandez 4bc9d19635 many changes, see change log 2022-05-25 19:14:33 +02:00
Alejandro Moreo Fernandez ba18d00334 trying to figure out how to refactor protocols meaninguflly 2021-12-20 11:39:44 +01:00
Alejandro Moreo Fernandez 5deb92b457 update doc 2021-12-07 17:16:39 +01:00
Alejandro Moreo Fernandez 2bd47f0841 updating the documentation 2021-12-06 18:25:47 +01:00
Alejandro Moreo Fernandez 4da1233b46 adapting everything to the new file format 2021-11-30 11:36:23 +01:00
Alejandro Moreo Fernandez 7468519495 testing baselines for lequa 2021-11-24 11:20:42 +01:00
Alejandro Moreo Fernandez 9a08125e7e evaluation script and format checker added 2021-10-25 13:37:22 +02:00
Alejandro Moreo Fernandez 5f15b365fe result file format check, read, load, and evaluation with pandas 2021-10-22 19:03:15 +02:00
Alejandro Moreo Fernandez 65b2c2ce74 setting baseline experiments with data format 2021-10-21 17:14:40 +02:00
Alejandro Moreo Fernandez 4284f1daa3 branch for LeQua2022 - first commit 2021-10-13 20:36:53 +02:00
Alejandro Moreo Fernandez ce908573e7 adding features for cross-lingual 2021-07-01 18:34:24 +02:00
Alejandro Moreo Fernandez 986e61620c cleaning data base 2021-06-21 12:57:44 +02:00
Alejandro Moreo Fernandez b4aeaa97b7 fixing issue regarding fit_learner=False in QuaNetTrainer 2021-06-21 12:55:39 +02:00
Alejandro Moreo Fernandez be2f54de9c renaming functions to match the app and npp nomenclature; adding npp as an option for GridSearchQ 2021-06-16 11:45:40 +02:00
Andrea Esuli 147b2f2212 Added back n_classes to Dataset 2021-05-06 16:28:30 +02:00
Andrea Esuli 5b772c7eda Bug fixes on use of classes_. Tests. 2021-05-05 17:12:44 +02:00
Andrea Esuli 44cec7a046 Added encoding option with default to utf-8. 2021-04-30 17:00:46 +02:00
Alejandro Moreo Fernandez ca4c344c55 imports fix 2021-03-19 17:34:09 +01:00
Alejandro Moreo Fernandez 854d759dc4 making everything work like in the wiki 2021-02-17 18:05:22 +01:00
Alejandro Moreo Fernandez 98b6e2b82d readme updated 2021-02-08 19:16:43 +01:00
Alejandro Moreo Fernandez 91f8d8f3e1 readme updated 2021-02-02 12:10:57 +01:00
Alejandro Moreo Fernandez b6f80077d7 standard scaller added 2021-01-29 10:21:13 +01:00
Alejandro Moreo Fernandez 3aaf57f2f3 all uci datasets from Pérez-Gállego added, quantification report added 2021-01-28 18:22:43 +01:00
Alejandro Moreo Fernandez 1d89301089 more uci datasets, plots improved (higher fonts), and evaluation script that shows numerical results in command line 2021-01-27 22:49:54 +01:00
Alejandro Moreo Fernandez e609c262b4 parallel functionality added to quapy in order to allow for multiprocess parallelization (and not threading) handling quapy's environment variables 2021-01-27 09:54:41 +01:00
Alejandro Moreo Fernandez 301e8b9088 adding uci ml datasets 2021-01-25 18:38:56 +01:00
Alejandro Moreo Fernandez 03cf73aff6 refactor: methods requiring a val_split can now declare a default value in the __init__ method that will be used in case the fit method is called without specifying the val_split, which now is by default None in the fit, i.e., by default takes the value of the init, that is generally set to 0.4; some uci datasets added; ensembles can now be optimized for quantification, and can be trained on samples of smaller size 2021-01-22 18:01:51 +01:00
Alejandro Moreo Fernandez 5e64d2588a import fixes 2021-01-15 18:32:32 +01:00
Alejandro Moreo Fernandez c5ae2f8b1f adding table manager 2021-01-15 08:33:39 +01:00
Alejandro Moreo Fernandez 3c5a53bdec testing quapy via replicating Tweet Quantification experiments 2021-01-12 17:39:00 +01:00
Alejandro Moreo Fernandez 2ec3400d15 adding tweet sent quant experiments 2021-01-11 18:31:12 +01:00
Alejandro Moreo Fernandez 41347b50f9 cleaning and adding some uci datasets 2021-01-11 12:55:06 +01:00
Alejandro Moreo Fernandez 326a8ab803 added Ensemble methods (methods ALL, ACC, Ptr, DS from Pérez-Gallego et al 2017 and 2019) and some UCI ML datasets used in those articles (only 5 datasets out of 32 they used) 2021-01-06 14:58:29 +01:00
Alejandro Moreo Fernandez d8e2f7556e QuaNet added, two examples of TextClassifiers added (CNN, LSTM) 2020-12-29 20:33:59 +01:00
Alejandro Moreo Fernandez 7bed93dcbf added model selection for quantification 2020-12-22 17:43:23 +01:00
Alejandro Moreo Fernandez 7d6f523e4b uniform sampling added if *prevs is empty 2020-12-17 18:17:17 +01:00
Alejandro Moreo Fernandez 649d412389 dataset fetch for polarity reviews (hp, kindle, imdb) and twitter sentiment (11 datasets) added 2020-12-14 18:36:19 +01:00
Alejandro Moreo Fernandez 9bc3a9f28a evaluation by artificial prevalence sampling added. New methods added. New util functions added to quapy.functional and quapy.utils 2020-12-10 19:04:33 +01:00