Toggle navigation
Toggle navigation
This project
Loading...
Sign in
Carlos-Francisco Méndez-Cruz
/
deep-learning-workshop
Go to a project
Toggle navigation
Toggle navigation pinning
Projects
Groups
Snippets
Help
Project
Activity
Repository
Pipelines
Graphs
Issues
0
Merge Requests
0
Wiki
Snippets
Network
Create a new issue
Builds
Commits
Issue Boards
Authored by
Carlos-Francisco Méndez-Cruz
2019-05-08 13:37:35 -0500
Browse Files
Options
Browse Files
Download
Email Patches
Plain Diff
Commit
2bfdcdf91dd4689686c93961e74f5efff6e88e03
2bfdcdf9
1 parent
d60f92ab
Deep Learning Workshop
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
7 additions
and
6 deletions
data-sets/get-hga-training-test-py27.py
data-sets/get-hga-training-test-py27.py
View file @
2bfdcdf
...
...
@@ -86,15 +86,16 @@ if __name__ == "__main__":
print
(
"Max exon length: {}"
.
format
(
max_exon_length
))
print
(
"Max utr length: {}"
.
format
(
max_utr_length
))
if
max_exon_length
>
max_utr_length
:
max_length
=
max_exon_length
else
:
max_length
=
max_utr_length
# Fill sequence with X char to get max length
# One-hot-encoding of sequences
for
sequence
,
label
in
zip
(
sequences
,
labels
):
if
label
==
"exon"
:
if
len
(
sequence
)
<
max_exon_length
:
sequence_adjust
=
sequence
.
ljust
(
max_exon_length
+
len
(
sequence
),
'X'
)
elif
label
==
"utr"
:
if
len
(
sequence
)
<
max_utr_length
:
sequence_adjust
=
sequence
.
ljust
(
max_utr_length
+
len
(
sequence
),
'X'
)
if
len
(
sequence
)
<
max_length
:
sequence_adjust
=
sequence
.
ljust
(
max_length
+
len
(
sequence
),
'X'
)
print
(
"Length sequence_adjust: {}"
.
format
(
len
(
sequence_adjust
)))
integer_encoded
=
integer_encoder
.
fit_transform
(
list
(
sequence_adjust
))
integer_encoded
=
np
.
array
(
integer_encoded
)
.
reshape
(
-
1
,
1
)
...
...
Please
register
or
login
to post a comment