nohup.out 1.45 KB
/usr/local/lib/python3.6/dist-packages/fuzzywuzzy/fuzz.py:11: UserWarning: Using slow pure-python SequenceMatcher. Install python-Levenshtein to remove this warning
  warnings.warn('Using slow pure-python SequenceMatcher. Install python-Levenshtein to remove this warning')


-------------------------------- PARAMETERS --------------------------------

--inputPath          Path of npl tagged file (crf output): /home/egaytan/automatic-extraction-growth-conditions/mapping_MCO/input/
--iAnnotatedFile     Input file of npl tagged file (crf output: srr_htregulondb_model_Run3_v10_S1_False_S2_True_S3_False_S4_False_Run3_v10.tsv
--iOntoFile          Input file with the ontology entities (MCO-terms): gc_ontology_terms_v2.txt
--iLinksFile         Input file with links and id for the ontology (MCO-type-links): None
--iSynFile           Input file for the additional ontology of synonyms (MCO-syn-json): mco_terms_v0.2.json
--outputPath         Output path to place output files: /home/egaytan/automatic-extraction-growth-conditions/mapping_MCO/output/
--outputFile         Output of the mapping process: srr_htregulondb_mapped.tsv
--minPerMatch        Minimal string matching percentage: 80
--minCRFProbs        Minimal crf probabilities allowed: 0.9



set()
{'SRR  GSE   GSM GPL  PMID  GSM_NAME  FULLTEXT  BANGLINE  SOURCE_TEXT_CTRL  FULL_TEXT  TERM_NAME TERM_TYPE   PROB'}
GSE, BANGLINE, GSM, FULL_TEXT, PROB, TERM_NAME, GPL, SRR, TERM_TYPE, PMID expected columns for iAnnotatedFile