andrea
8940c99102
also saving n_components if auto optimizing it
...
removed some unnecessary columns from result csv
2019-12-12 14:33:41 +01:00
andrea
dd34a96f87
also saving n_components if auto optimizing it
...
removed some unnecessary columns from result csv
2019-12-12 11:18:38 +01:00
andrea
0c6056e7a1
refactored pca methods
2019-12-09 15:39:39 +01:00
andrea
9fa1899a7f
refactored pca methods
2019-12-09 15:37:52 +01:00
andrea
509289b268
Plot variance explained by PCA for every language
2019-12-04 13:24:11 +01:00
andrea
ba1a72ff94
Plot variance explained by PCA for every language
2019-12-04 10:16:17 +01:00
andrea
f074fd97f9
get_optimal_supervised_components method - to be polished
2019-12-03 19:57:11 +01:00
andrea
4de6b3e250
refactoring emebed method into Class StorageEmbeddings. refactoring class EmbeddingsAligned.
...
tSVD and T-SNE for supervised embeddings
2019-12-03 15:34:12 +01:00
andrea
f2083bf22a
reworked unsupervised (aligned) embeddings loader method and class (fastText and MUSE).
...
new op.arg -t ['MUSE', 'FastText']
uploaded /results/results.csv (on rcv1 ... run0.pickle) obtained on all available setup.
TODO: refactor it also as a standalone class with its own load/weighted sum/extract/reduce methods.
2019-11-30 19:14:51 +01:00
andrea
601da33836
fixed embeddings remote import and dataset name in logfile
2019-11-28 23:10:14 +01:00
andrea
2a5d0243db
first commit
2019-11-28 18:39:19 +01:00