Commit Graph

11 Commits

Author SHA1 Message Date
andrea 8940c99102 also saving n_components if auto optimizing it
removed some unnecessary columns from result csv
2019-12-12 14:33:41 +01:00
andrea dd34a96f87 also saving n_components if auto optimizing it
removed some unnecessary columns from result csv
2019-12-12 11:18:38 +01:00
andrea 0c6056e7a1 refactored pca methods 2019-12-09 15:39:39 +01:00
andrea 9fa1899a7f refactored pca methods 2019-12-09 15:37:52 +01:00
andrea 509289b268 Plot variance explained by PCA for every language 2019-12-04 13:24:11 +01:00
andrea ba1a72ff94 Plot variance explained by PCA for every language 2019-12-04 10:16:17 +01:00
andrea f074fd97f9 get_optimal_supervised_components method - to be polished 2019-12-03 19:57:11 +01:00
andrea 4de6b3e250 refactoring emebed method into Class StorageEmbeddings. refactoring class EmbeddingsAligned.
tSVD and T-SNE for supervised embeddings
2019-12-03 15:34:12 +01:00
andrea f2083bf22a reworked unsupervised (aligned) embeddings loader method and class (fastText and MUSE).
new op.arg -t ['MUSE', 'FastText']
uploaded /results/results.csv (on rcv1 ... run0.pickle) obtained on all available setup.
TODO: refactor it also as a standalone class with its own load/weighted sum/extract/reduce methods.
2019-11-30 19:14:51 +01:00
andrea 601da33836 fixed embeddings remote import and dataset name in logfile 2019-11-28 23:10:14 +01:00
andrea 2a5d0243db first commit 2019-11-28 18:39:19 +01:00