1
0
Fork 0

readme updated

This commit is contained in:
Alejandro Moreo Fernandez 2021-02-08 19:16:43 +01:00
parent 91f8d8f3e1
commit 98b6e2b82d
2 changed files with 6 additions and 6 deletions

View File

@ -42,7 +42,7 @@ we could assume the IID assumption to hold, as this prevalence would simply coin
class prevalence of the training set. That is to say, a Quantification model class prevalence of the training set. That is to say, a Quantification model
should be tested across samples characterized by different class prevalences. should be tested across samples characterized by different class prevalences.
QuaPy implements sampling procedures and evaluation protocols that automates this endeavour. QuaPy implements sampling procedures and evaluation protocols that automates this endeavour.
See the Wiki for detailed examples. See the [Wiki](https://github.com/HLT-ISTI/QuaPy/wiki) for detailed examples.
## Features ## Features
@ -60,7 +60,7 @@ SVM-based variants for quantification, HDy, QuaNet, and Ensembles).
## Requirements ## Requirements
* sklearnm, numpy, scipy * scikit-learn, numpy, scipy
* pytorch (for QuaNet) * pytorch (for QuaNet)
* svmperf patched for quantification (see below) * svmperf patched for quantification (see below)
* joblib * joblib
@ -92,5 +92,8 @@ for quantification.
This patch extends the former by also allowing SVMperf to optimize for This patch extends the former by also allowing SVMperf to optimize for
_AE_ and _RAE_. _AE_ and _RAE_.
## Wiki
Check our [Wiki](https://github.com/HLT-ISTI/QuaPy/wiki) in which many examples
are provided.

View File

@ -81,9 +81,6 @@ def standardize(dataset: Dataset, inplace=True):
return Dataset(training, test, dataset.vocabulary, dataset.name) return Dataset(training, test, dataset.vocabulary, dataset.name)
def index(dataset: Dataset, min_df=5, inplace=False, **kwargs): def index(dataset: Dataset, min_df=5, inplace=False, **kwargs):
""" """
Indexes a dataset of strings. To index a document means to replace each different token by a unique numerical index. Indexes a dataset of strings. To index a document means to replace each different token by a unique numerical index.