1
0
Fork 0
Commit Graph

248 Commits

Author SHA1 Message Date
Alejandro Moreo Fernandez a4c33a8e4d import fix 2022-10-04 17:44:16 +02:00
Alejandro Moreo Fernandez e40c409609 bugfix in NeuralClassifierTrainer; it was only configured to work well in binary problems 2022-10-04 11:03:08 +02:00
Alejandro Moreo Fernandez 3af7c70a53 restoring the default legend in diag plot 2022-10-04 09:12:51 +02:00
Alejandro Moreo Fernandez 543003f914
Merge pull request #13 from pglez82/dys_implementation
Dys implementation
2022-07-12 13:05:35 +02:00
Pablo González a4584b79db changing gridsearchQ to ensure reproducibility 2022-07-11 16:27:02 +02:00
Pablo González c91961cff5 adding to __init__.py 2022-07-11 14:10:04 +02:00
Pablo González 428f10fb2d adding SMM 2022-07-11 14:06:14 +02:00
Alejandro Moreo Fernandez ecd0ad7ec7 unit test for replicability based on qp.util.temp_seed 2022-07-11 14:00:25 +02:00
Pablo González 46e294002f dys implementation 2022-07-11 12:21:49 +02:00
Pablo González 750814ef2a fixing bug in ACC when using cross validation 2022-06-24 14:20:08 +02:00
Pablo González 02dd2846ff changing app to use prevalence_linspace function with smooth limits 2022-06-24 14:05:47 +02:00
Pablo González cf7d37c793 removing log message 2022-06-21 11:07:00 +02:00
Alejandro Moreo Fernandez 8f6aa629b8 param seed changed to random_state 2022-06-21 10:49:30 +02:00
Alejandro Moreo Fernandez cef20d8b32 Merge branch 'protocols' of github.com:HLT-ISTI/QuaPy into protocols 2022-06-21 10:27:12 +02:00
Alejandro Moreo Fernandez f4a2a94ba5 fixing random_state in base and in protocols 2022-06-21 10:27:06 +02:00
Alejandro Moreo Fernandez cf0bd14cf1 bug fix in covariate shift protocol 2022-06-17 12:51:52 +02:00
Alejandro Moreo Fernandez c0c37f0a17 return type in covariate protocol 2022-06-16 16:54:15 +02:00
Alejandro Moreo Fernandez a7c768bb40 param fix 2022-06-16 16:38:34 +02:00
Alejandro Moreo Fernandez c795404e7f import fix 2022-06-15 16:54:42 +02:00
Alejandro Moreo Fernandez 789b9d5fbc pathfix in lequa2022 datasets 2022-06-15 14:36:02 +02:00
Alejandro Moreo Fernandez 2cc7db60cc updating parallel policy to take n_jobs from environment (not yet tested) 2022-06-14 09:35:39 +02:00
Alejandro Moreo Fernandez 82a01478ec collator functions in protocols for preparing the outputs 2022-06-03 18:02:52 +02:00
Alejandro Moreo Fernandez bfe4b8b51a updating properties of labelled collection 2022-06-03 13:51:22 +02:00
Alejandro Moreo Fernandez 45642ad778 lequa as dataset 2022-06-01 18:28:59 +02:00
Alejandro Moreo Fernandez eba6fd8123 optimization conditional in the prediction function 2022-05-26 17:59:23 +02:00
Alejandro Moreo Fernandez 4bc9d19635 many changes, see change log 2022-05-25 19:14:33 +02:00
Alejandro Moreo Fernandez 46e3632200 ongoing protocols 2022-05-23 00:20:08 +02:00
Alejandro Moreo Fernandez b453c8fcbc first commit protocols 2022-05-20 16:48:46 +02:00
Alejandro Moreo Fernandez cbe3f410ed updating diagonal plot legend 2022-05-20 11:52:59 +02:00
Alejandro Moreo Fernandez 6a5c528154 Merge branch 'master' of github.com:HLT-ISTI/QuaPy 2022-05-19 13:43:57 +02:00
Alejandro Moreo Fernandez fd339839a5 removing redundant code 2022-05-19 13:43:32 +02:00
Alejandro Moreo Fernandez 9f4a9cb3fd Merge branch 'master' of github.com:HLT-ISTI/QuaPy 2022-04-12 17:23:39 +02:00
Alejandro Moreo Fernandez 524ec37f83 sample_size can now be set to None to indicate that the value has to be resolved by inspecting the environment variable SAMPLE_SIZE 2022-04-12 17:13:38 +02:00
Alejandro Moreo Fernandez fa577abdd2 merging from pool request and adding documentation 2022-03-15 14:16:37 +01:00
Alejandro Moreo Fernandez de9d5aaf5b Merge branch 'master' of github.com:HLT-ISTI/QuaPy 2022-03-14 16:43:03 +01:00
Alejandro Moreo Fernandez 8ee5e499f5 bugfix when the number of positive elemnts for one of the classes is 0 2022-03-14 16:42:41 +01:00
Pablo González 2fde7921d4
updating comments
I think this comments are not correct. Changing them
2022-02-11 14:33:00 +01:00
Alejandro Moreo Fernandez ba18d00334 trying to figure out how to refactor protocols meaninguflly 2021-12-20 11:39:44 +01:00
Alejandro Moreo Fernandez 164f7d8d5c documenting quanet 2021-12-15 16:39:57 +01:00
Alejandro Moreo Fernandez 9cf9c73824 adding documentation for ensembles 2021-12-15 15:46:15 +01:00
Alejandro Moreo Fernandez 3835f89e9d adding documentation 2021-12-15 15:27:43 +01:00
Alejandro Moreo Fernandez 5deb92b457 update doc 2021-12-07 17:16:39 +01:00
Alejandro Moreo Fernandez 2bd47f0841 updating the documentation 2021-12-06 18:25:47 +01:00
Alejandro Moreo Fernandez 4da1233b46 adapting everything to the new file format 2021-11-30 11:36:23 +01:00
Alejandro Moreo Fernandez 8368c467dc adapting new format 2021-11-26 10:57:49 +01:00
Alejandro Moreo Fernandez 7468519495 testing baselines for lequa 2021-11-24 11:20:42 +01:00
Alejandro Moreo Fernandez 1a3755eb58 adding documentation, adding brokenbar plots, merging plots from tweetsent with density 2021-11-22 18:10:48 +01:00
Alejandro Moreo Fernandez b78c8268fd update qp.error documentation 2021-11-12 15:37:31 +01:00
Alejandro Moreo Fernandez 3eb760901f doc update, official baselines for T1A and T1B refactored 2021-11-12 14:30:02 +01:00
Alejandro Moreo Fernandez 611d080ca6 format fix 2021-11-09 15:44:57 +01:00
Alejandro Moreo Fernandez 238a30520c adapting everything to the new format 2021-11-08 18:01:49 +01:00
Alejandro Moreo Fernandez a7e87e41f8 GridSearchQ adapted to work with generator functions and integrated for the baselines of LeQua2022; some tests with SVD 2021-10-26 18:41:10 +02:00
Alejandro Moreo Fernandez 9a08125e7e evaluation script and format checker added 2021-10-25 13:37:22 +02:00
Alejandro Moreo Fernandez 5f15b365fe result file format check, read, load, and evaluation with pandas 2021-10-22 19:03:15 +02:00
Alejandro Moreo Fernandez 65b2c2ce74 setting baseline experiments with data format 2021-10-21 17:14:40 +02:00
Alejandro Moreo Fernandez 4284f1daa3 branch for LeQua2022 - first commit 2021-10-13 20:36:53 +02:00
Alejandro Moreo Fernandez 537a95fa18 Merge branch 'crosslingual' of gitea-s2i2s.isti.cnr.it:moreo/QuaPy 2021-07-02 13:51:52 +02:00
Alejandro Moreo Fernandez a091b2af82 update to 0.1.6 2021-07-02 11:09:43 +02:00
Alejandro Moreo Fernandez f9b80ae437 bugfix in quanet 2021-07-02 10:20:42 +02:00
Alejandro Moreo Fernandez ce908573e7 adding features for cross-lingual 2021-07-01 18:34:24 +02:00
Alejandro Moreo Fernandez 986e61620c cleaning data base 2021-06-21 12:57:44 +02:00
Alejandro Moreo Fernandez b4aeaa97b7 fixing issue regarding fit_learner=False in QuaNetTrainer 2021-06-21 12:55:39 +02:00
Alejandro Moreo Fernandez f33abb5319 Merge branch 'master' of https://github.com/HLT-ISTI/QuaPy 2021-06-21 11:13:38 +02:00
Alejandro Moreo Fernandez a1cdc9ef43 fixing fit_learner=False case in QuaNet 2021-06-21 11:13:14 +02:00
Alejandro Moreo Fernandez 8cc11c0276
Merge pull request #6 from pglez82/master
fixing two problems with parameters: hidden_size and lstm_nlayers
2021-06-21 10:32:58 +02:00
Alejandro Moreo Fernandez 8239947746 refit=True default value in GridSearchQ 2021-06-16 13:53:54 +02:00
Alejandro Moreo Fernandez 294e251450 bugfix when refit=False in model selection 2021-06-16 13:23:44 +02:00
Alejandro Moreo Fernandez 4f07680381 adding Forman's methods 2021-06-16 12:03:37 +02:00
Alejandro Moreo Fernandez be2f54de9c renaming functions to match the app and npp nomenclature; adding npp as an option for GridSearchQ 2021-06-16 11:45:40 +02:00
Alejandro Moreo Fernandez f28a84242f adding more colors to plots 2021-06-16 09:38:13 +02:00
Alejandro Moreo Fernandez 491ac56877 adding more colors to plots 2021-06-15 10:10:19 +02:00
Alejandro Moreo Fernandez e8c3e29911 svm-perf leaks model, fixing... 2021-06-15 07:49:16 +02:00
Alejandro Moreo Fernandez 9fd9d096f6 Merge branch 'master' of https://github.com/HLT-ISTI/QuaPy 2021-06-11 11:00:37 +02:00
Alejandro Moreo Fernandez 2e842a1d1f change in pcalr 2021-06-11 10:59:35 +02:00
Alejandro Moreo Fernandez 2586e6ed7d fix in PCALR 2021-06-11 10:52:30 +02:00
Pablo Gonzalez cb40b71a38 fixing two problems with parameters: hidden_size and lstm_nlayers 2021-06-07 12:22:06 +02:00
Andrea Esuli 9c880c3d24 version update 2021-06-01 16:15:31 +02:00
Alejandro Moreo Fernandez 460efe7105 OneVsAll does not have attribute learner_ solved 2021-06-01 16:07:01 +02:00
Alejandro Moreo Fernandez be1fa757d6 cleaning 2021-05-27 16:56:09 +02:00
Alejandro Moreo Fernandez bef16fe518 Merge branch 'master' of github.com:HLT-ISTI/QuaPy 2021-05-27 16:54:11 +02:00
Alejandro Moreo Fernandez 731b54c5ba adding natural sampling protocol 2021-05-27 16:53:58 +02:00
Andrea Esuli 79fbbd9d80 pip package 2021-05-10 13:36:35 +02:00
Andrea Esuli 32b25146c1 Tests 2021-05-10 10:26:51 +02:00
Andrea Esuli 147b2f2212 Added back n_classes to Dataset 2021-05-06 16:28:30 +02:00
Andrea Esuli 5b772c7eda Bug fixes on use of classes_. Tests. 2021-05-05 17:12:44 +02:00
Andrea Esuli bfbfe08116 Added classes_ property to all quantifiers. 2021-05-04 17:09:13 +02:00
Andrea Esuli 70a3d4bd0f Tests for non aggregative and meta methods. 2021-05-04 12:14:14 +02:00
Andrea Esuli 8f284e540a Tests 2021-04-30 17:22:58 +02:00
Andrea Esuli 44cec7a046 Added encoding option with default to utf-8. 2021-04-30 17:00:46 +02:00
Andrea Esuli d86c402916 Added first tests 2021-04-29 16:07:39 +02:00
Alejandro Moreo Fernandez 1d12e96867 cleaning 2021-04-28 11:27:25 +02:00
Alejandro Moreo Fernandez 252e143ef6 fine grained sld updates... 2021-04-28 10:04:21 +02:00
Alejandro Moreo Fernandez f3b505eb4e Merge branch 'master' of gitea-s2i2s.isti.cnr.it:moreo/QuaPy 2021-04-27 11:27:52 +02:00
Alejandro Moreo Fernandez ca4c344c55 imports fix 2021-03-19 17:34:09 +01:00
Alejandro Moreo Fernandez 8381bce3a8 more fgsld 2021-03-11 19:00:40 +01:00
Alejandro Moreo Fernandez 3491ed7d72 plot fix 2021-03-11 09:27:14 +01:00
Alejandro Moreo Fernandez 775417c8eb bugfix in PACC 2021-02-18 13:48:41 +01:00
Alejandro Moreo Fernandez 854d759dc4 making everything work like in the wiki 2021-02-17 18:05:22 +01:00
Alejandro Moreo Fernandez 70da8f7925 updating the documentation 2021-02-16 19:38:52 +01:00
Alejandro Moreo Fernandez a2ec72496a adding eval_budget to evaluation functions 2021-02-09 11:48:16 +01:00
Alejandro Moreo Fernandez 98b6e2b82d readme updated 2021-02-08 19:16:43 +01:00
Alejandro Moreo Fernandez 91f8d8f3e1 readme updated 2021-02-02 12:10:57 +01:00
Alejandro Moreo Fernandez b6f80077d7 standard scaller added 2021-01-29 10:21:13 +01:00
Alejandro Moreo Fernandez 3aaf57f2f3 all uci datasets from Pérez-Gállego added, quantification report added 2021-01-28 18:22:43 +01:00
Alejandro Moreo Fernandez 1d89301089 more uci datasets, plots improved (higher fonts), and evaluation script that shows numerical results in command line 2021-01-27 22:49:54 +01:00
Alejandro Moreo Fernandez e609c262b4 parallel functionality added to quapy in order to allow for multiprocess parallelization (and not threading) handling quapy's environment variables 2021-01-27 09:54:41 +01:00
Alejandro Moreo Fernandez 301e8b9088 adding uci ml datasets 2021-01-25 18:38:56 +01:00
Alejandro Moreo Fernandez 8a53cfe677 parallel in threading mode (another) 2021-01-25 16:46:59 +01:00
Alejandro Moreo Fernandez 2fda46fc13 qp.environment was not reachable within Parallel processes; changing backend to threading 2021-01-25 09:58:39 +01:00
Alejandro Moreo Fernandez e7527bd7ed bugfix 2021-01-25 09:02:11 +01:00
Alejandro Moreo Fernandez 2001c6d852 running ensembles 2021-01-25 08:54:10 +01:00
Alejandro Moreo Fernandez 03cf73aff6 refactor: methods requiring a val_split can now declare a default value in the __init__ method that will be used in case the fit method is called without specifying the val_split, which now is by default None in the fit, i.e., by default takes the value of the init, that is generally set to 0.4; some uci datasets added; ensembles can now be optimized for quantification, and can be trained on samples of smaller size 2021-01-22 18:01:51 +01:00
Alejandro Moreo Fernandez 54dc2980e6 creating checkpointdir in fit 2021-01-22 10:03:08 +01:00
Alejandro Moreo Fernandez bf1cc74ba1 quapy fixed 2021-01-22 09:58:12 +01:00
Alejandro Moreo Fernandez 1ba0748b59 experimental method ave-pool, not working due to the fact that onevsall is aggregative and ave-pool is not 2021-01-20 17:03:12 +01:00
Alejandro Moreo Fernandez 99132c8166 fixing quanet 2021-01-20 12:35:14 +01:00
Alejandro Moreo Fernandez f69eb59eb8 launching quanet 2021-01-20 09:01:04 +01:00
Alejandro Moreo Fernandez 482e4453a8 refactor of ensembles, launching EPACC with Ptr policy 2021-01-19 18:26:40 +01:00
Alejandro Moreo Fernandez 1399125fb8 adding QuaNet to experiments of Twitter; trying new stuff in 'NewMethods' 2021-01-18 19:14:04 +01:00
Alejandro Moreo Fernandez b30c40b7a0 some refactor made in order to accomodate OneVsAll to operate with aggregative probabilistic quantifiers; launching OneVsAll(HDy) 2021-01-18 16:52:19 +01:00
Alejandro Moreo Fernandez 8ef9e6a633 bugfix 2021-01-18 10:53:22 +01:00
Alejandro Moreo Fernandez 5e64d2588a import fixes 2021-01-15 18:32:32 +01:00
Alejandro Moreo Fernandez 9c7c017acd import fix 2021-01-15 17:47:55 +01:00
Alejandro Moreo Fernandez 27c688eb2e import fix 2021-01-15 17:44:28 +01:00
Alejandro Moreo Fernandez 865dafaefc setting a timeout for model_selection combinations in order to prevent some combinations to stuck the model selection 2021-01-15 17:42:19 +01:00
Alejandro Moreo Fernandez c5ae2f8b1f adding table manager 2021-01-15 08:33:39 +01:00
Alejandro Moreo Fernandez 3c5a53bdec testing quapy via replicating Tweet Quantification experiments 2021-01-12 17:39:00 +01:00
Alejandro Moreo Fernandez 3e07feda3c import bug fixed 2021-01-12 09:35:49 +01:00
Alejandro Moreo Fernandez 2ec3400d15 adding tweet sent quant experiments 2021-01-11 18:31:12 +01:00
Alejandro Moreo Fernandez 41347b50f9 cleaning and adding some uci datasets 2021-01-11 12:55:06 +01:00
Alejandro Moreo Fernandez d1b449d2e9 plot functionality added 2021-01-07 17:58:48 +01:00
Alejandro Moreo Fernandez 5894d46b31 cleaning 2021-01-06 15:00:48 +01:00
Alejandro Moreo Fernandez 326a8ab803 added Ensemble methods (methods ALL, ACC, Ptr, DS from Pérez-Gallego et al 2017 and 2019) and some UCI ML datasets used in those articles (only 5 datasets out of 32 they used) 2021-01-06 14:58:29 +01:00
Alejandro Moreo Fernandez d8e2f7556e QuaNet added, two examples of TextClassifiers added (CNN, LSTM) 2020-12-29 20:33:59 +01:00
Alejandro Moreo Fernandez 3ec711c96e model selection for quantification added 2020-12-23 11:14:35 +01:00
Alejandro Moreo Fernandez 7bed93dcbf added model selection for quantification 2020-12-22 17:43:23 +01:00
Alejandro Moreo Fernandez 7d6f523e4b uniform sampling added if *prevs is empty 2020-12-17 18:17:17 +01:00
Alejandro Moreo Fernandez 71949e9a03 cleaning 2020-12-15 15:20:35 +01:00
Alejandro Moreo Fernandez 649d412389 dataset fetch for polarity reviews (hp, kindle, imdb) and twitter sentiment (11 datasets) added 2020-12-14 18:36:19 +01:00
Alejandro Moreo Fernandez c8a1a70c8a refactoring aggregative methods as methods that not only implement 'classify' and 'quantify', but that also implement 'aggregate' and that, by default, have a default implementation of 'quantify' as a pipeline of 'classify' and 'aggregate'; this helps speeding up evaluations A LOT, since the documents can be pre-classified and the samples are carried out across pre-classified values (labels, or posterior probabilities), and thus only aggregate is called many times within the artificial sampling protocol 2020-12-11 19:28:17 +01:00
Alejandro Moreo Fernandez e55caf82fd merged 2020-12-10 19:08:22 +01:00
Alejandro Moreo Fernandez 9bc3a9f28a evaluation by artificial prevalence sampling added. New methods added. New util functions added to quapy.functional and quapy.utils 2020-12-10 19:04:33 +01:00
Alejandro Moreo Fernandez 2361186a01 aggregation methods updated 2020-12-09 12:46:50 +01:00
Alejandro Moreo Fernandez 9c8d29156c aggregative methods adapted. Explicit loss minimization methods (SVMQ, SVMKLD, ...) added and with support to binary or single-label. HDy added 2020-12-04 19:32:08 +01:00
Alejandro Moreo Fernandez a882424eeb many aggregative methods added 2020-12-03 18:12:28 +01:00
Alejandro Moreo Fernandez 78988c87f9 svmperf wrapper added 2020-12-03 16:59:13 +01:00
Alejandro Moreo Fernandez e81009e665 fixing dataset loading 2020-12-03 16:36:54 +01:00
Alejandro Moreo Fernandez b6820e8dba data loading 2020-12-03 16:24:21 +01:00