Carlos-Francisco Méndez-Cruz

Feature extraction and vectorizer three sentences

Vectorizer: t
['agn43', 'be', 'by', 'of', 'oxyr', 'repress', 'repression', 'the', 'transcription']
[[0.32999531 0.55873062 0.42492904 0. 0.32999531 0.42492904
0. 0. 0.32999531]
[0.46333427 0. 0. 0. 0.46333427 0.59662724
0. 0. 0.46333427]
[0.27463443 0. 0.35364183 0.46499651 0.27463443 0.
0.46499651 0.46499651 0.27463443]]
[[1. 0.71221865 0.4221569 ]
[0.71221865 1. 0.38174263]
[0.4221569 0.38174263 1. ]]
\ No newline at end of file
Vectorizer: t
['dt', 'gene', 'in', 'nn', 'tf', 'vbn', 'vbz']
[[0. 0.32999531 0.42492904 0.32999531 0.32999531 0.55873062
0.42492904]
[0.43007025 0.25400642 0.65415902 0.50801283 0.25400642 0.
0. ]
[0. 0.46333427 0. 0.46333427 0.46333427 0.
0.59662724]]
[[1. 0.61325487 0.71221865]
[0.61325487 1. 0.47075951]
[0.71221865 0.47075951 1. ]]
\ No newline at end of file
Vectorizer: t
['dt', 'in', 'nn', 'vbn', 'vbz']
[[0. 0. 0.91892665 0. 0.39442846]
[0.33529805 0.51000562 0.79212972 0. 0. ]
[0. 0.33046836 0.76991449 0.43452618 0.33046836]]
[[1. 0.72790911 0.83784107]
[0.72790911 1. 0.77841287]
[0.83784107 0.77841287 1. ]]
\ No newline at end of file
Vectorizer: b
['agn43', 'by', 'is', 'of', 'oxyr', 'repressed', 'represses', 'repression', 'the', 'transcription']
[[1. 0. 0. 0. 1. 0. 1. 0. 0. 1.]
[1. 1. 0. 1. 1. 0. 0. 1. 1. 1.]
[1. 1. 1. 0. 1. 1. 0. 0. 0. 1.]]
[[1. 0.56694671 0.61237244]
[0.56694671 1. 0.6172134 ]
[0.61237244 0.6172134 1. ]]
\ No newline at end of file
Vectorizer: t
['agn43', 'by', 'is', 'of', 'oxyr', 'repressed', 'represses', 'repression', 'the', 'transcription']
[[0.41285857 0. 0. 0. 0.41285857 0.
0.69903033 0. 0. 0.41285857]
[0.27463443 0.35364183 0. 0.46499651 0.27463443 0.
0. 0.46499651 0.46499651 0.27463443]
[0.31021184 0.39945423 0.52523431 0. 0.31021184 0.52523431
0. 0. 0. 0.31021184]]
[[1. 0.34015553 0.38422086]
[0.34015553 1. 0.39684828]
[0.38422086 0.39684828 1. ]]
\ No newline at end of file