From 9b9aed893e4ef8a714ecb326efb8d9a27148d714 Mon Sep 17 00:00:00 2001
From: =?UTF-8?q?Alejandro=20Moreo=20Fern=C3=A1ndez?=
 <alejandro.moreo@isti.cnr.it>
Date: Fri, 3 Apr 2020 11:34:30 +0200
Subject: [PATCH 1/3] Update README.md

---
 README.md | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/README.md b/README.md
index 566133a..a6bb24e 100755
--- a/README.md
+++ b/README.md
@@ -15,8 +15,8 @@ The experiments have been run using the following packages (older versions might
 
 
 ## Disclaimer:
-The dataset is not distributed in this version. We have asked the Editors for permission to publish the corpus.
-We are waiting for some of these responses to arrive.
+The dataset is not distributed in this version. We have asked the Editors of each document for permission to publish the corpus.
+We are waiting for some of these responses to arrive. 
 
 ## Running the Experiments
 The script in __./src/author_identification.py__ executes the experiments. This is the script syntax (--help):

From c77ac59331279b534f418ea39490555a82faf423 Mon Sep 17 00:00:00 2001
From: =?UTF-8?q?Alejandro=20Moreo=20Fern=C3=A1ndez?=
 <alejandro.moreo@isti.cnr.it>
Date: Mon, 6 Apr 2020 12:07:55 +0200
Subject: [PATCH 2/3] Update README.md

---
 README.md | 4 +---
 1 file changed, 1 insertion(+), 3 deletions(-)

diff --git a/README.md b/README.md
index a6bb24e..a7d4140 100755
--- a/README.md
+++ b/README.md
@@ -1,8 +1,6 @@
 # Authorship Verification for Medieval Latin 
 
-Code to reproduce the experiments reported in the papers
-["The Epistle to Cangrande Through the Lens of Computational Authorship Verification"](https://link.springer.com/chapter/10.1007/978-3-030-30754-7_15)
-and 
+Code to reproduce the experiments reported in the paper
 ["L’Epistola a Cangrande al vaglio della Computational Authorship Verification: Risultati preliminari (con una postilla sulla cosiddetta XIV Epistola di Dante Alighieri)"](https://www.academia.edu/42297516/L_Epistola_a_Cangrande_al_vaglio_della_Computational_Authorship_Verification_risultati_preliminari_con_una_postilla_sulla_cosiddetta_XIV_Epistola_di_Dante_Alighieri_in_Nuove_inchieste_sull_Epistola_a_Cangrande_a_c._di_A._Casadei_Pisa_Pisa_University_Press_pp._153-192)
 
 ## Requirements:

From f0e596c1accb827b779975866ec5dc28fec7b126 Mon Sep 17 00:00:00 2001
From: =?UTF-8?q?Alejandro=20Moreo=20Fern=C3=A1ndez?=
 <alejandro.moreo@isti.cnr.it>
Date: Mon, 15 Jun 2020 18:50:39 +0200
Subject: [PATCH 3/3] Update README.md

---
 README.md | 18 ++++++++----------
 1 file changed, 8 insertions(+), 10 deletions(-)

diff --git a/README.md b/README.md
index a7d4140..53cb23d 100755
--- a/README.md
+++ b/README.md
@@ -11,10 +11,8 @@ The experiments have been run using the following packages (older versions might
 * scikit-learn==0.22.2.post1
 * scipy==1.4.1
 
-
-## Disclaimer:
-The dataset is not distributed in this version. We have asked the Editors of each document for permission to publish the corpus.
-We are waiting for some of these responses to arrive. 
+## Dataset:
+The dataset can be downloaded from [http://hlt.isti.cnr.it/medlatin/](http://hlt.isti.cnr.it/medlatin/).
 
 ## Running the Experiments
 The script in __./src/author_identification.py__ executes the experiments. This is the script syntax (--help):
@@ -42,7 +40,7 @@ optional arguments:
 The following command line:
 ```
 cd src
-python author_identification.py ../Corpora/CorpusI Dante --unknown ../Epistle/EpistolaXIII_1.txt
+python author_identification.py ../Corpora/MedLatin1 Dante --unknown ../Epistle/EpistolaXIII_1.txt
 ```
 
 Will use all texts in ../Corpora/CorpusI as training documents to train a verificator for the 
@@ -54,18 +52,18 @@ to the positive class.
 Similarly, the command line:
 ```
 cd src
-python author_identification.py ../Corpora/CorpusI ALL --loo 
+python author_identification.py ../Corpora/MedLatin1 ALL --loo 
 ```
 will perform a cross-validation of the binary classifier for all authors using all training documents in a leave-one-out (LOO) fashion.
 
 The script will report the results both in the standard output (more elaborated) and in a log file. For example, the last command will produce a log file containing:
 ```
-F1 for ClaraAssisiensis = 0.400
+F1 for ClaraAssisiensis = 0.571
 F1 for Dante = 0.957
 F1 for GiovanniBoccaccio = 1.000
-F1 for GuidoFaba = 0.974
+F1 for GuidoFaba = 0.980
 F1 for PierDellaVigna = 0.993
-LOO Macro-F1 = 0.865
-LOO Micro-F1 = 0.981
+LOO Macro-F1 = 0.900
+LOO Micro-F1 = 0.985
 ```
 (Note that small numerical variations with respect to the original papers might occur due to different software versions and as a result from any stochastic underlying process. Those changes should anyway not alter the conclusions derived from the published results.)