Ignacio Arroyo Fernández

Update README.md

...@@ -7,6 +7,7 @@ The main method follows the next pipeline: ...@@ -7,6 +7,7 @@ The main method follows the next pipeline:
7 ### Training mode 7 ### Training mode
8 - Parse abstracts from two input files (classA and classB; see files format at the `data/` directory) 8 - Parse abstracts from two input files (classA and classB; see files format at the `data/` directory)
9 - Transform abstracts into their TFIDF sparse representations 9 - Transform abstracts into their TFIDF sparse representations
10 +- Transform TFIDF representations into their 200-dimensional SVD approximation and save it at `model_binClass/svd_model.pkl`
10 - Train Support Vector Machines with different parameters by using GridSearch 11 - Train Support Vector Machines with different parameters by using GridSearch
11 - Select the best estimator and save it at `model_binClass/svm_model.pkl` (default) 12 - Select the best estimator and save it at `model_binClass/svm_model.pkl` (default)
12 - Save TFIDF transformation for keeping the training vocabulary (stored at `model_binClass/tfidf_model.pkl`) 13 - Save TFIDF transformation for keeping the training vocabulary (stored at `model_binClass/tfidf_model.pkl`)
......