Carlos-Francisco Méndez-Cruz

Feature extraction and vectorizer three sentences

1 +the repression of agn43 transcription by oxyR
2 +
1 +the repression of agn43 transcription by OxyR
2 +
1 +Vectorizer: t
2 +['agn43', 'be', 'by', 'of', 'oxyr', 'repress', 'repression', 'the', 'transcription']
3 +[[0.32999531 0.55873062 0.42492904 0. 0.32999531 0.42492904
4 + 0. 0. 0.32999531]
5 + [0.46333427 0. 0. 0. 0.46333427 0.59662724
6 + 0. 0. 0.46333427]
7 + [0.27463443 0. 0.35364183 0.46499651 0.27463443 0.
8 + 0.46499651 0.46499651 0.27463443]]
9 +[[1. 0.71221865 0.4221569 ]
10 + [0.71221865 1. 0.38174263]
11 + [0.4221569 0.38174263 1. ]]
...\ No newline at end of file ...\ No newline at end of file
1 +Vectorizer: t
2 +['dt', 'gene', 'in', 'nn', 'tf', 'vbn', 'vbz']
3 +[[0. 0.32999531 0.42492904 0.32999531 0.32999531 0.55873062
4 + 0.42492904]
5 + [0.43007025 0.25400642 0.65415902 0.50801283 0.25400642 0.
6 + 0. ]
7 + [0. 0.46333427 0. 0.46333427 0.46333427 0.
8 + 0.59662724]]
9 +[[1. 0.61325487 0.71221865]
10 + [0.61325487 1. 0.47075951]
11 + [0.71221865 0.47075951 1. ]]
...\ No newline at end of file ...\ No newline at end of file
1 +Vectorizer: t
2 +['dt', 'in', 'nn', 'vbn', 'vbz']
3 +[[0. 0. 0.91892665 0. 0.39442846]
4 + [0.33529805 0.51000562 0.79212972 0. 0. ]
5 + [0. 0.33046836 0.76991449 0.43452618 0.33046836]]
6 +[[1. 0.72790911 0.83784107]
7 + [0.72790911 1. 0.77841287]
8 + [0.83784107 0.77841287 1. ]]
...\ No newline at end of file ...\ No newline at end of file
1 +Vectorizer: b
2 +['agn43', 'by', 'is', 'of', 'oxyr', 'repressed', 'represses', 'repression', 'the', 'transcription']
3 +[[1. 0. 0. 0. 1. 0. 1. 0. 0. 1.]
4 + [1. 1. 0. 1. 1. 0. 0. 1. 1. 1.]
5 + [1. 1. 1. 0. 1. 1. 0. 0. 0. 1.]]
6 +[[1. 0.56694671 0.61237244]
7 + [0.56694671 1. 0.6172134 ]
8 + [0.61237244 0.6172134 1. ]]
...\ No newline at end of file ...\ No newline at end of file
1 +Vectorizer: t
2 +['agn43', 'by', 'is', 'of', 'oxyr', 'repressed', 'represses', 'repression', 'the', 'transcription']
3 +[[0.41285857 0. 0. 0. 0.41285857 0.
4 + 0.69903033 0. 0. 0.41285857]
5 + [0.27463443 0.35364183 0. 0.46499651 0.27463443 0.
6 + 0. 0.46499651 0.46499651 0.27463443]
7 + [0.31021184 0.39945423 0.52523431 0. 0.31021184 0.52523431
8 + 0. 0. 0. 0.31021184]]
9 +[[1. 0.34015553 0.38422086]
10 + [0.34015553 1. 0.39684828]
11 + [0.38422086 0.39684828 1. ]]
...\ No newline at end of file ...\ No newline at end of file