Comments (5)
Thank you for the response! After checking your eventalign.txt closely, it seems that your nanopolish run does not produce the column start_idx and end_idx that is required by m6anet to preprocess the file. You need to include --signal-index option in your nanopolish eventalign command, let me know if it is working for you!
from m6anet.
Hi @FraPria, thanks for raising this issue. Can I check whether data.json contains all NaN, or if those are the only entries with NaN? Can you show me the output of data.index as well? If data.index output is fine, then do you think you can give me a minimal reproducible example that results in this error? Perhaps you can extract eventalign outputs only from those positions with NaN entries
I don't expect m6anet-run_inference to run successfully given those outputs but thanks for raising this, in the future I will include some checks to make sure that it will not run if there is a problem with data.json
from m6anet.
Hi @chrishendra93, thanks for your answer!
the output of data.index is like this:
transcript_id,transcript_position,start,end
68ebd370-cef0-4b97-bd1b-825d623446e2_chr1:18000,2170,0,113
68ebd370-cef0-4b97-bd1b-825d623446e2_chr1:18000,3061,113,226
68ebd370-cef0-4b97-bd1b-825d623446e2_chr1:18000,3153,226,377
68ebd370-cef0-4b97-bd1b-825d623446e2_chr1:18000,3159,377,528
68ebd370-cef0-4b97-bd1b-825d623446e2_chr1:18000,3565,528,641
c6e29c47-9e37-427a-81d5-70e87a061505_ENSG00000228463,101,641,986
c6e29c47-9e37-427a-81d5-70e87a061505_ENSG00000228463,156,986,1293
c6e29c47-9e37-427a-81d5-70e87a061505_ENSG00000228463,194,1293,1600
c6e29c47-9e37-427a-81d5-70e87a061505_ENSG00000228463,228,1600,2021
Here I attach a subset of the eventalign output
eventalign.txt
Running the command:
m6anet-dataprep --eventalign eventalign.txt --out_dir .
I get this output:
{"68ebd370-cef0-4b97-bd1b-825d623446e2_chr1:18000":{"2170":{"GAGACAT":[[NaN,NaN,NaN,NaN,NaN,NaN,NaN,NaN,NaN]]}}}
{"68ebd370-cef0-4b97-bd1b-825d623446e2_chr1:18000":{"3153":{"GTGACCC":[[NaN,NaN,NaN,NaN,NaN,NaN,NaN,NaN,NaN]]}}}
{"68ebd370-cef0-4b97-bd1b-825d623446e2_chr1:18000":{"3159":{"CTGACTT":[[NaN,NaN,NaN,NaN,NaN,NaN,NaN,NaN,NaN]]}}}
{"c6e29c47-9e37-427a-81d5-70e87a061505_ENSG00000228463":{"101":{"AAGACAC":[[NaN,NaN,NaN,NaN,NaN,NaN,NaN,NaN,NaN]]}}}
{"c6e29c47-9e37-427a-81d5-70e87a061505_ENSG00000228463":{"194":{"CTGACCA":[[NaN,NaN,NaN,NaN,NaN,NaN,NaN,NaN,NaN]]}}}
{"c6e29c47-9e37-427a-81d5-70e87a061505_ENSG00000228463":{"228":{"AAAACAA":[[NaN,NaN,NaN,NaN,NaN,NaN,NaN,NaN,NaN]]}}}
{"c6e29c47-9e37-427a-81d5-70e87a061505_ENSG00000228463":{"303":{"ATGACAG":[[NaN,NaN,NaN,NaN,NaN,NaN,NaN,NaN,NaN]]}}}
{"c6e29c47-9e37-427a-81d5-70e87a061505_ENSG00000228463":{"398":{"GTAACAA":[[NaN,NaN,NaN,NaN,NaN,NaN,NaN,NaN,NaN]]}}}
{"c6e29c47-9e37-427a-81d5-70e87a061505_ENSG00000228463":{"418":{"TAAACTG":[[NaN,NaN,NaN,NaN,NaN,NaN,NaN,NaN,NaN]]}}}
{"c6e29c47-9e37-427a-81d5-70e87a061505_ENSG00000228463":{"436":{"CTAACTC":[[NaN,NaN,NaN,NaN,NaN,NaN,NaN,NaN,NaN]]}}}
It outputs also a warning:
site-packages/m6anet-0.1.1-py3.7.egg/m6anet/scripts/dataprep.py:143: SettingWithCopyWarning:
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead
See the caveats in the documentation: https://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy
Thank you
from m6anet.
That was the problem, now it gives the expected output, thanks a lot!
from m6anet.
Hi @FraPria, glad that you can run the model successfully. I will update the documentation with this instruction just in case so that people know what commands to supply to Nanopolish in order to run the model successfully. I will now close this issue, let us know if you have any more problems running the model!
from m6anet.
Related Issues (20)
- Identification of m6A with the SG-NEx samples HOT 4
- About the RNA04 dataset to train the model
- Arabidopsis m6anet model for RNA004 HOT 2
- Inference JSONDecoderError HOT 2
- m6Anet inference output missing probability_modified values HOT 8
- Question about the PCA in the paper
- Memory usage exploding HOT 3
- Empty Eventalign.txt for Negative Sense Viral RNA HOT 1
- dataprep and data.log HOT 8
- Questions about the m6A motif HOT 1
- Genome vs transcriptome alignment HOT 4
- m6anet, eventalign, galaxy HOT 3
- error with m6anet dataprep HOT 1
- yet another error with m6anet dataprep HOT 1
- running with test eventalign test file file form github produces another error HOT 8
- installation on linux failed HOT 3
- TypeError: argument should be integer or None, not 'numpy.float64' HOT 2
- Interpreting 'inference' results HOT 1
- m6anet dataprep fails to complete: cannot convert float NaN to integer HOT 3
- Issues with Read Selection HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from m6anet.