I installed WGD with all prerequisites. I tried the following command after preparing the mcl file using the
[ashermoshe@login-0-0 ~/dorothee]$ wgd ksd schlosseri.mcl Botryllus_schlosseri.fas
command, but I got an error somewhere downstream.
I would appreciate your help in understanding how to approach it.
2019-02-27 17:18:23: INFO
2019-02-27 17:18:23: INFO codeml found
2019-02-27 17:18:23: INFO MUSCLE v3.7 by Robert C. Edgar
2019-02-27 17:18:23: INFO
2019-02-27 17:18:23: WARNING Output directory exists, will possibly overwrite
2019-02-27 17:18:24: INFO Translating CDS file
100% (65587 of 65587) |##################################################################################################################| Elapsed Time: 0:00:17 Time: 0:00:17
2019-02-27 17:18:42: WARNING There were 0 warnings during translation
2019-02-27 17:18:42: INFO Started whole paranome Ks analysis
2019-02-27 17:18:42: WARNING Filtered out the 5 largest gene families because n*(n-1)/2 > `max_pairwise`
2019-02-27 17:18:42: WARNING If you want to analyse these large families anyhow, please raise the `max_pairwise` parameter.
2019-02-27 17:18:42: INFO Started analysis in parallel (n_threads = 4)
2019-02-27 17:18:42: INFO Performing analysis on gene family GF_000006
2019-02-27 17:18:43: INFO Performing analysis on gene family GF_000007
2019-02-27 17:18:43: INFO Performing analysis on gene family GF_000008
2019-02-27 17:18:43: INFO Performing analysis on gene family GF_000009
2019-02-27 17:18:49: INFO Performing analysis on gene family GF_000010
2019-02-27 17:19:12: INFO Performing analysis on gene family GF_000011
2019-02-27 17:19:13: INFO Performing analysis on gene family GF_000012
2019-02-27 17:21:05: INFO Performing analysis on gene family GF_000013
2019-02-27 17:21:57: INFO Performing analysis on gene family GF_000014
2019-02-27 17:22:08: INFO Performing analysis on gene family GF_000015
2019-02-27 17:22:35: INFO Performing analysis on gene family GF_000016
2019-02-27 17:23:03: INFO Performing analysis on gene family GF_000017
2019-02-27 17:23:22: INFO Performing analysis on gene family GF_000018
2019-02-27 17:23:50: INFO Performing analysis on gene family GF_000019
2019-02-27 17:24:02: INFO Performing analysis on gene family GF_000020
2019-02-27 17:26:30: INFO Performing analysis on gene family GF_000021
2019-02-27 17:26:35: INFO Performing analysis on gene family GF_000022
2019-02-27 17:26:40: INFO Performing analysis on gene family GF_000023
2019-02-27 17:26:42: INFO Performing analysis on gene family GF_000024
2019-02-27 17:26:50: INFO Performing analysis on gene family GF_000025
2019-02-27 17:26:59: INFO Performing analysis on gene family GF_000026
2019-02-27 17:39:17: INFO Performing analysis on gene family GF_000027
2019-02-27 17:39:17: INFO Performing analysis on gene family GF_000028
2019-02-27 17:40:25: INFO Performing analysis on gene family GF_000029
2019-02-27 17:40:52: INFO Performing analysis on gene family GF_000030
2019-02-27 17:41:04: INFO Performing analysis on gene family GF_000031
2019-02-27 17:41:09: INFO Performing analysis on gene family GF_000032
2019-02-27 17:48:02: WARNING No Ks value for Botryllus_schlosseri_TRINITY_DN77472_c4_g3::TRINITY_DN77472_c4_g3_i12::g.41868::m.41868 - Botryllus_schlosseri_TRINITY_DN77472_c4_g3::TRINITY_DN77472_c4_g3_i7::g.41841::m.41841!
2019-02-27 17:48:02: WARNING No Ks value for Botryllus_schlosseri_TRINITY_DN82389_c1_g1::TRINITY_DN82389_c1_g1_i1::g.79015::m.79015 - Botryllus_schlosseri_TRINITY_DN82597_c2_g1::TRINITY_DN82597_c2_g1_i1::g.145711::m.145711!
2019-02-27 17:48:03: WARNING No Ks value for Botryllus_schlosseri_TRINITY_DN82579_c2_g2::TRINITY_DN82579_c2_g2_i2::g.147899::m.147899 - Botryllus_schlosseri_TRINITY_DN82597_c2_g1::TRINITY_DN82597_c2_g1_i1::g.145711::m.145711!
2019-02-27 17:48:03: WARNING No Ks value for Botryllus_schlosseri_TRINITY_DN82579_c2_g2::TRINITY_DN82579_c2_g2_i2::g.147899::m.147899 - Botryllus_schlosseri_TRINITY_DN82389_c1_g1::TRINITY_DN82389_c1_g1_i1::g.79015::m.79015!
2019-02-27 17:48:03: WARNING No Ks value for Botryllus_schlosseri_TRINITY_DN79293_c3_g3::TRINITY_DN79293_c3_g3_i1::g.306985::m.306985 - Botryllus_schlosseri_TRINITY_DN70319_c5_g1::TRINITY_DN70319_c5_g1_i6::g.86980::m.86980!
2019-02-27 17:48:03: WARNING No Ks value for Botryllus_schlosseri_TRINITY_DN80015_c3_g1::TRINITY_DN80015_c3_g1_i11::g.69428::m.69428 - Botryllus_schlosseri_TRINITY_DN83038_c1_g1::TRINITY_DN83038_c1_g1_i3::g.203559::m.203559!
2019-02-27 17:48:03: WARNING No Ks value for Botryllus_schlosseri_TRINITY_DN72881_c1_g2::TRINITY_DN72881_c1_g2_i9::g.188509::m.188509 - Botryllus_schlosseri_TRINITY_DN76803_c3_g2::TRINITY_DN76803_c3_g2_i4::g.291848::m.291848!
2019-02-27 17:48:03: WARNING No Ks value for Botryllus_schlosseri_TRINITY_DN82877_c1_g1::TRINITY_DN82877_c1_g1_i7::g.185179::m.185179 - Botryllus_schlosseri_TRINITY_DN83038_c1_g1::TRINITY_DN83038_c1_g1_i3::g.203559::m.203559!
2019-02-27 17:48:03: WARNING No Ks value for Botryllus_schlosseri_TRINITY_DN82877_c1_g1::TRINITY_DN82877_c1_g1_i7::g.185179::m.185179 - Botryllus_schlosseri_TRINITY_DN76803_c3_g2::TRINITY_DN76803_c3_g2_i4::g.291848::m.291848!
2019-02-27 17:48:03: WARNING No Ks value for Botryllus_schlosseri_TRINITY_DN82877_c1_g1::TRINITY_DN82877_c1_g1_i7::g.185179::m.185179 - Botryllus_schlosseri_TRINITY_DN80015_c3_g1::TRINITY_DN80015_c3_g1_i11::g.69428::m.69428!
2019-02-27 17:48:03: WARNING No Ks value for Botryllus_schlosseri_TRINITY_DN65697_c0_g1::TRINITY_DN65697_c0_g1_i1::g.293174::m.293174 - Botryllus_schlosseri_TRINITY_DN83038_c1_g1::TRINITY_DN83038_c1_g1_i3::g.203559::m.203559!
2019-02-27 17:48:03: WARNING No Ks value for Botryllus_schlosseri_TRINITY_DN65697_c0_g1::TRINITY_DN65697_c0_g1_i1::g.293174::m.293174 - Botryllus_schlosseri_TRINITY_DN80015_c3_g1::TRINITY_DN80015_c3_g1_i11::g.69428::m.69428!
2019-02-27 17:48:03: WARNING No Ks value for Botryllus_schlosseri_TRINITY_DN65697_c0_g1::TRINITY_DN65697_c0_g1_i1::g.293174::m.293174 - Botryllus_schlosseri_TRINITY_DN82877_c1_g1::TRINITY_DN82877_c1_g1_i7::g.185179::m.185179!
2019-02-27 17:48:03: WARNING No Ks value for Botryllus_schlosseri_TRINITY_DN65038_c0_g1::TRINITY_DN65038_c0_g1_i1::g.106604::m.106604 - Botryllus_schlosseri_TRINITY_DN79594_c0_g1::TRINITY_DN79594_c0_g1_i1::g.278650::m.278650!
2019-02-27 17:48:03: WARNING No Ks value for Botryllus_schlosseri_TRINITY_DN65038_c0_g1::TRINITY_DN65038_c0_g1_i1::g.106604::m.106604 - Botryllus_schlosseri_TRINITY_DN76803_c3_g2::TRINITY_DN76803_c3_g2_i4::g.291848::m.291848!
2019-02-27 17:48:03: WARNING No Ks value for Botryllus_schlosseri_TRINITY_DN65038_c0_g1::TRINITY_DN65038_c0_g1_i2::g.106605::m.106605 - Botryllus_schlosseri_TRINITY_DN79594_c0_g1::TRINITY_DN79594_c0_g1_i1::g.278650::m.278650!
2019-02-27 17:48:03: WARNING No Ks value for Botryllus_schlosseri_TRINITY_DN65038_c0_g1::TRINITY_DN65038_c0_g1_i2::g.106605::m.106605 - Botryllus_schlosseri_TRINITY_DN76803_c3_g2::TRINITY_DN76803_c3_g2_i4::g.291848::m.291848!
2019-02-27 17:48:03: WARNING No Ks value for Botryllus_schlosseri_TRINITY_DN65038_c0_g1::TRINITY_DN65038_c0_g1_i2::g.106605::m.106605 - Botryllus_schlosseri_TRINITY_DN65038_c0_g1::TRINITY_DN65038_c0_g1_i1::g.106604::m.106604!
2019-02-27 17:48:03: WARNING No Ks value for Botryllus_schlosseri_TRINITY_DN83231_c0_g1::TRINITY_DN83231_c0_g1_i1::g.112228::m.112228 - Botryllus_schlosseri_TRINITY_DN76803_c3_g2::TRINITY_DN76803_c3_g2_i4::g.291848::m.291848!
2019-02-27 17:48:03: WARNING No Ks value for Botryllus_schlosseri_TRINITY_DN83231_c0_g1::TRINITY_DN83231_c0_g1_i1::g.112228::m.112228 - Botryllus_schlosseri_TRINITY_DN82877_c1_g1::TRINITY_DN82877_c1_g1_i7::g.185179::m.185179!
2019-02-27 17:48:03: WARNING No Ks value for Botryllus_schlosseri_TRINITY_DN80868_c1_g1::TRINITY_DN80868_c1_g1_i17::g.220347::m.220347 - Botryllus_schlosseri_TRINITY_DN83038_c1_g1::TRINITY_DN83038_c1_g1_i3::g.203559::m.203559!
2019-02-27 17:48:03: WARNING No Ks value for Botryllus_schlosseri_TRINITY_DN80868_c1_g1::TRINITY_DN80868_c1_g1_i17::g.220347::m.220347 - Botryllus_schlosseri_TRINITY_DN76803_c3_g2::TRINITY_DN76803_c3_g2_i4::g.291848::m.291848!
2019-02-27 17:48:03: WARNING No Ks value for Botryllus_schlosseri_TRINITY_DN80868_c1_g1::TRINITY_DN80868_c1_g1_i17::g.220347::m.220347 - Botryllus_schlosseri_TRINITY_DN80015_c3_g1::TRINITY_DN80015_c3_g1_i11::g.69428::m.69428!
2019-02-27 17:48:03: WARNING No Ks value for Botryllus_schlosseri_TRINITY_DN80868_c1_g1::TRINITY_DN80868_c1_g1_i17::g.220347::m.220347 - Botryllus_schlosseri_TRINITY_DN82877_c1_g1::TRINITY_DN82877_c1_g1_i7::g.185179::m.185179!
2019-02-27 17:48:03: WARNING No Ks value for Botryllus_schlosseri_TRINITY_DN80868_c1_g1::TRINITY_DN80868_c1_g1_i17::g.220347::m.220347 - Botryllus_schlosseri_TRINITY_DN65697_c0_g1::TRINITY_DN65697_c0_g1_i1::g.293174::m.293174!
2019-02-27 17:48:03: WARNING No Ks value for Botryllus_schlosseri_TRINITY_DN80868_c1_g1::TRINITY_DN80868_c1_g1_i17::g.220347::m.220347 - Botryllus_schlosseri_TRINITY_DN83231_c0_g1::TRINITY_DN83231_c0_g1_i1::g.112228::m.112228!
2019-02-27 17:48:03: WARNING No Ks value for Botryllus_schlosseri_TRINITY_DN83038_c0_g1::TRINITY_DN83038_c0_g1_i2::g.203556::m.203556 - Botryllus_schlosseri_TRINITY_DN76803_c3_g2::TRINITY_DN76803_c3_g2_i4::g.291848::m.291848!
2019-02-27 17:48:03: WARNING No Ks value for Botryllus_schlosseri_TRINITY_DN83038_c0_g1::TRINITY_DN83038_c0_g1_i2::g.203556::m.203556 - Botryllus_schlosseri_TRINITY_DN82877_c1_g1::TRINITY_DN82877_c1_g1_i7::g.185179::m.185179!
2019-02-27 17:48:03: WARNING No Ks value for Botryllus_schlosseri_TRINITY_DN83038_c0_g1::TRINITY_DN83038_c0_g1_i2::g.203556::m.203556 - Botryllus_schlosseri_TRINITY_DN83231_c0_g1::TRINITY_DN83231_c0_g1_i1::g.112228::m.112228!
2019-02-27 17:48:03: WARNING No Ks value for Botryllus_schlosseri_TRINITY_DN83038_c0_g1::TRINITY_DN83038_c0_g1_i2::g.203556::m.203556 - Botryllus_schlosseri_TRINITY_DN80868_c1_g1::TRINITY_DN80868_c1_g1_i17::g.220347::m.220347!
2019-02-27 17:48:03: WARNING No Ks value for Botryllus_schlosseri_TRINITY_DN81132_c0_g1::TRINITY_DN81132_c0_g1_i4::g.47004::m.47004 - Botryllus_schlosseri_TRINITY_DN77472_c4_g3::TRINITY_DN77472_c4_g3_i7::g.41841::m.41841!
2019-02-27 17:48:03: WARNING No Ks value for Botryllus_schlosseri_TRINITY_DN81132_c0_g1::TRINITY_DN81132_c0_g1_i4::g.47004::m.47004 - Botryllus_schlosseri_TRINITY_DN77472_c4_g3::TRINITY_DN77472_c4_g3_i12::g.41868::m.41868!
2019-02-27 17:48:05: INFO Performing analysis on gene family GF_000033
2019-02-27 17:51:44: INFO Performing analysis on gene family GF_000034
2019-02-27 17:51:56: INFO Performing analysis on gene family GF_000035
multiprocessing.pool.RemoteTraceback:
"""
Traceback (most recent call last):
File "/share/apps/anaconda3-5.1.0/lib/python3.6/site-packages/joblib/_parallel_backends.py", line 350, in __call__
return self.func(*args, **kwargs)
File "/share/apps/anaconda3-5.1.0/lib/python3.6/site-packages/joblib/parallel.py", line 131, in __call__
return [func(*args, **kwargs) for func, args, kwargs in self.items]
File "/share/apps/anaconda3-5.1.0/lib/python3.6/site-packages/joblib/parallel.py", line 131, in <listcomp>
return [func(*args, **kwargs) for func, args, kwargs in self.items]
File "/share/apps/anaconda3-5.1.0/lib/python3.6/site-packages/wgd/ks_distribution.py", line 303, in analyse_family
results_dict, msa=msa_path_protein, method=method)
File "/share/apps/anaconda3-5.1.0/lib/python3.6/site-packages/wgd/ks_distribution.py", line 98, in _weighting
tree_path, pairwise_estimates['Ks'])
File "/share/apps/anaconda3-5.1.0/lib/python3.6/site-packages/wgd/phy.py", line 123, in phylogenetic_tree_to_cluster_format
t = Tree(tree)
File "/share/apps/anaconda3-5.1.0/lib/python3.6/site-packages/ete3/coretype/tree.py", line 211, in __init__
quoted_names=quoted_node_names)
File "/share/apps/anaconda3-5.1.0/lib/python3.6/site-packages/ete3/parser/newick.py", line 249, in read_newick
raise NewickError('Unexisting tree file or Malformed newick tree structure.')
ete3.parser.newick.NewickError: Unexisting tree file or Malformed newick tree structure.
You may want to check other newick loading flags like 'format' or 'quoted_node_names'.
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "/share/apps/anaconda3-5.1.0/lib/python3.6/multiprocessing/pool.py", line 119, in worker
result = (True, func(*args, **kwds))
File "/share/apps/anaconda3-5.1.0/lib/python3.6/site-packages/joblib/_parallel_backends.py", line 359, in __call__
raise TransportableException(text, e_type)
joblib.my_exceptions.TransportableException: TransportableException
___________________________________________________________________________
NewickError Wed Feb 27 18:10:41 2019
PID: 62472 Python 3.6.8: /share/apps/anaconda3-5.1.0/bin/python
...........................................................................
/share/apps/anaconda3-5.1.0/lib/python3.6/site-packages/joblib/parallel.py in __call__(self=<joblib.parallel.BatchedCalls object>)
126 def __init__(self, iterator_slice):
127 self.items = list(iterator_slice)
128 self._size = len(self.items)
129
130 def __call__(self):
--> 131 return [func(*args, **kwargs) for func, args, kwargs in self.items]
self.items = [(<function analyse_family>, ('GF_000012', {'Botryllus_schlosseri_TRINITY_DN47590_c0_g1::TRINITY_DN47590_c0_g1_i1::g.253656::m.253656': 'MGETKSTITNLYGHNNRAKRRTQLRELAIRTRTEKSHLIAGDFNGIDS...ARRQKDQLRSDEKEMQKVRNRVIQNQQYLSDYRKIKKKIWKEKAESLHK', 'Botryllus_schlosseri_TRINITY_DN50048_c0_g1::TRINITY_DN50048_c0_g1_i1::g.248848::m.248848': 'GAVISIDFLKAYDSVDHSFLHNTLEEAGFGVKVRAFFKAIYQGGSAKV...SGMKGKIATPSYADDVTITLAKEEESTKALQIVAEFGKASGLQINRKKT', 'Botryllus_schlosseri_TRINITY_DN50048_c0_g2::TRINITY_DN50048_c0_g2_i1::g.248849::m.248849': 'QKTTSQFARGIIKTIFKKGDKEDIRNYRPITILNVDYKIISKVITNRIQKVLPTITHRHQFINPPNTIGDLNLLLREVTSDMRERSRGA', 'Botryllus_schlosseri_TRINITY_DN50048_c0_g3::TRINITY_DN50048_c0_g3_i1::g.248850::m.248850': 'AGDFNGIDDIELDRDPVNIRHDAADAKYSKRVMEVIGVTDAFRQVHGS...CNHMPCPFSDHGATTALVKLTDHRPRRPNTWKNNTKVYEMEAFETELEV', 'Botryllus_schlosseri_TRINITY_DN50048_c0_g4::TRINITY_DN50048_c0_g4_i1::g.248851::m.248851': 'KTWADKATDLGKLQLEAKADAEEALGKRPHLLCEKIKVRRDAVSITAIKDATGKTTEDPEEIRETVEEFYQKLYSKRETDKCTANSFHRYQDAKLSTR', 'Botryllus_schlosseri_TRINITY_DN50200_c0_g1::TRINITY_DN50200_c0_g1_i1::g.61823::m.61823': 'GEDGLSSELYMVNLDLMKKELTEVYNEIYEAQGTTTSLGRAVLKIIHK...KNYRPISLLNSDYKILSKILTNRLKQALPSITHQHQHVNPPKTIGQINL', 'Botryllus_schlosseri_TRINITY_DN52423_c0_g2::TRINITY_DN52423_c0_g2_i1::g.262149::m.262149': 'GKANISIGGKLGGNIRLGRGIKQGDPISMLLFTMATDPLLQRLNHDLD...DVNITLAHQADVNEALKIIQDFEEASALKLNKNKSKGITYHPKPPPGSK', 'Botryllus_schlosseri_TRINITY_DN52423_c0_g3::TRINITY_DN52423_c0_g3_i1::g.262150::m.262150': 'PPPGSKNVLKWVQSMEVLGHVINRHPPNDHETWNGLANKAKDLMREIK...YVATLKKMPINTRRELETAVTELLFGKSMRPDYRKLIQRREAGGIGLVD', 'Botryllus_schlosseri_TRINITY_DN58171_c0_g1::TRINITY_DN58171_c0_g1_i1::g.196397::m.196397': 'LEYEIVEFLFGKGKRPEYKKLVQQEIAGGREVKDIPTITDIIFIKPAV...RTDHQLALTLGWLRERPINNSRPHTWQPRQHWAEMAKIMKEMEYKRDYI', 'Botryllus_schlosseri_TRINITY_DN61293_c1_g1::TRINITY_DN61293_c1_g1_i1::g.134659::m.134659': 'AAFVMDLDGKIEYEKIIQARESGGLELVDIPTMTDLAFVKPALRYLQR...MKRYGMRKINNAIPHVFQPLQHWQETEKTMRSLGRQQQDIKSKRRERYR', ...}, {'Botryllus_schlosseri_TRINITY_DN10080_c0_g1::TRINITY_DN10080_c0_g1_i1::g.308497::m.308497': 'CTTAATTGTGCTTCATTTTTTAGCTCTCAAATGGCTCGATCAACGAAA...CATGGCGGAAGGCACGAAGGCCGTCGCCAAGCTCGCTGCCAGCAAATAA', 'Botryllus_schlosseri_TRINITY_DN10087_c0_g1::TRINITY_DN10087_c0_g1_i1::g.308496::m.308496': 'ATGGCGGTGGTGACACTGTTCTCGGTGGGGCATTCTAGGGAGGTGCGG...CTACCTCGACCCTCGGCTAGACCAACCATGGCCCCGCGTCGCGAAGGCC', 'Botryllus_schlosseri_TRINITY_DN1013_c0_g1::TRINITY_DN1013_c0_g1_i1::g.248880::m.248880': 'TGGGCAGTCCTGACTGGCACTATCAAGAATAACAGCAACATCCAATGC...GAAAAACTATGGGGACATTATTAACCCTGCTGATGGAATCTGTTTGACC', 'Botryllus_schlosseri_TRINITY_DN1013_c0_g2::TRINITY_DN1013_c0_g2_i1::g.248881::m.248881': 'CTGATTCTTCTCATCGGCTCTCTCTGCTTCACTCTGACCCACGGCCTT...TCTGAAGGCCTACGCGATTGCCAAGACGGCCAAAGACTCTTACAACTGG', 'Botryllus_schlosseri_TRINITY_DN10166_c0_g2::TRINITY_DN10166_c0_g2_i1::g.27418::m.27418': 'GTACATTTGAAGAAAATGTCTGAATTGATCCATCACGAAAAAGCTTTC...GGTCATCTCGAACAAAATGCACAGAACCATCGTGATCAGAAGAGACTAC', 'Botryllus_schlosseri_TRINITY_DN1017_c0_g1::TRINITY_DN1017_c0_g1_i1::g.248877::m.248877': 'TGCCATCTTCCGGCACTCATGGCCAAGATTGACGAGGTGCACCACTCG...CCAAGTTAAGACCTTGCCCACTGAGAAGCATACCCAGCCGGAGTGCTAG', 'Botryllus_schlosseri_TRINITY_DN101_c0_g1::TRINITY_DN101_c0_g1_i1::g.75736::m.75736': 'GAGCGATATCACGCTCGCTCGTTCAGCAAGCGCGAACGGCTGTATCGG...GAACCGGACCTTCGTCGATCTACCTGAGTTCATAGATGAAATGGATCCG', 'Botryllus_schlosseri_TRINITY_DN10203_c0_g1::TRINITY_DN10203_c0_g1_i1::g.234676::m.234676': 'GAAGGGCGGTTCGTGCCGCACCTGGTGGTCCGCATCGACGCCGGCACG...GGTCAACACAGATAAGGTATCGGCTACCACGGCCATCGAGGTCTTGCGC', 'Botryllus_schlosseri_TRINITY_DN10220_c0_g2::TRINITY_DN10220_c0_g2_i1::g.234678::m.234678': 'GATCATCGGTATGGACTACCCGTGGAAATCGAGATGACTACAATGGAC...TCGAGTGGTGCTAAATGCTGCTACTGATACATTTCGTGGTATACTAGAT', 'Botryllus_schlosseri_TRINITY_DN10249_c0_g1::TRINITY_DN10249_c0_g1_i1::g.234674::m.234674': 'GAAGGACTCTCGTTGATCGAACGCCGCAACTGCATGGAGCTGTTCCGA...GGATCGTCTCGACAAGGTTTACGCCAGGCACAGCGACATGGTCCTGTTG', ...}, '/groups/pupko/ashermoshe/dorothee/ks_tmp.371cd0d9072862', 'codeml', False, 1, 100, 'fasttree', 'muscle', '/groups/pupko/ashermoshe/dorothee/wgd_ksd'), {})]
132
133 def __len__(self):
134 return self._size
135
...........................................................................
/share/apps/anaconda3-5.1.0/lib/python3.6/site-packages/joblib/parallel.py in <listcomp>(.0=<list_iterator object>)
126 def __init__(self, iterator_slice):
127 self.items = list(iterator_slice)
128 self._size = len(self.items)
129
130 def __call__(self):
--> 131 return [func(*args, **kwargs) for func, args, kwargs in self.items]
func = <function analyse_family>
args = ('GF_000012', {'Botryllus_schlosseri_TRINITY_DN47590_c0_g1::TRINITY_DN47590_c0_g1_i1::g.253656::m.253656': 'MGETKSTITNLYGHNNRAKRRTQLRELAIRTRTEKSHLIAGDFNGIDS...ARRQKDQLRSDEKEMQKVRNRVIQNQQYLSDYRKIKKKIWKEKAESLHK', 'Botryllus_schlosseri_TRINITY_DN50048_c0_g1::TRINITY_DN50048_c0_g1_i1::g.248848::m.248848': 'GAVISIDFLKAYDSVDHSFLHNTLEEAGFGVKVRAFFKAIYQGGSAKV...SGMKGKIATPSYADDVTITLAKEEESTKALQIVAEFGKASGLQINRKKT', 'Botryllus_schlosseri_TRINITY_DN50048_c0_g2::TRINITY_DN50048_c0_g2_i1::g.248849::m.248849': 'QKTTSQFARGIIKTIFKKGDKEDIRNYRPITILNVDYKIISKVITNRIQKVLPTITHRHQFINPPNTIGDLNLLLREVTSDMRERSRGA', 'Botryllus_schlosseri_TRINITY_DN50048_c0_g3::TRINITY_DN50048_c0_g3_i1::g.248850::m.248850': 'AGDFNGIDDIELDRDPVNIRHDAADAKYSKRVMEVIGVTDAFRQVHGS...CNHMPCPFSDHGATTALVKLTDHRPRRPNTWKNNTKVYEMEAFETELEV', 'Botryllus_schlosseri_TRINITY_DN50048_c0_g4::TRINITY_DN50048_c0_g4_i1::g.248851::m.248851': 'KTWADKATDLGKLQLEAKADAEEALGKRPHLLCEKIKVRRDAVSITAIKDATGKTTEDPEEIRETVEEFYQKLYSKRETDKCTANSFHRYQDAKLSTR', 'Botryllus_schlosseri_TRINITY_DN50200_c0_g1::TRINITY_DN50200_c0_g1_i1::g.61823::m.61823': 'GEDGLSSELYMVNLDLMKKELTEVYNEIYEAQGTTTSLGRAVLKIIHK...KNYRPISLLNSDYKILSKILTNRLKQALPSITHQHQHVNPPKTIGQINL', 'Botryllus_schlosseri_TRINITY_DN52423_c0_g2::TRINITY_DN52423_c0_g2_i1::g.262149::m.262149': 'GKANISIGGKLGGNIRLGRGIKQGDPISMLLFTMATDPLLQRLNHDLD...DVNITLAHQADVNEALKIIQDFEEASALKLNKNKSKGITYHPKPPPGSK', 'Botryllus_schlosseri_TRINITY_DN52423_c0_g3::TRINITY_DN52423_c0_g3_i1::g.262150::m.262150': 'PPPGSKNVLKWVQSMEVLGHVINRHPPNDHETWNGLANKAKDLMREIK...YVATLKKMPINTRRELETAVTELLFGKSMRPDYRKLIQRREAGGIGLVD', 'Botryllus_schlosseri_TRINITY_DN58171_c0_g1::TRINITY_DN58171_c0_g1_i1::g.196397::m.196397': 'LEYEIVEFLFGKGKRPEYKKLVQQEIAGGREVKDIPTITDIIFIKPAV...RTDHQLALTLGWLRERPINNSRPHTWQPRQHWAEMAKIMKEMEYKRDYI', 'Botryllus_schlosseri_TRINITY_DN61293_c1_g1::TRINITY_DN61293_c1_g1_i1::g.134659::m.134659': 'AAFVMDLDGKIEYEKIIQARESGGLELVDIPTMTDLAFVKPALRYLQR...MKRYGMRKINNAIPHVFQPLQHWQETEKTMRSLGRQQQDIKSKRRERYR', ...}, {'Botryllus_schlosseri_TRINITY_DN10080_c0_g1::TRINITY_DN10080_c0_g1_i1::g.308497::m.308497': 'CTTAATTGTGCTTCATTTTTTAGCTCTCAAATGGCTCGATCAACGAAA...CATGGCGGAAGGCACGAAGGCCGTCGCCAAGCTCGCTGCCAGCAAATAA', 'Botryllus_schlosseri_TRINITY_DN10087_c0_g1::TRINITY_DN10087_c0_g1_i1::g.308496::m.308496': 'ATGGCGGTGGTGACACTGTTCTCGGTGGGGCATTCTAGGGAGGTGCGG...CTACCTCGACCCTCGGCTAGACCAACCATGGCCCCGCGTCGCGAAGGCC', 'Botryllus_schlosseri_TRINITY_DN1013_c0_g1::TRINITY_DN1013_c0_g1_i1::g.248880::m.248880': 'TGGGCAGTCCTGACTGGCACTATCAAGAATAACAGCAACATCCAATGC...GAAAAACTATGGGGACATTATTAACCCTGCTGATGGAATCTGTTTGACC', 'Botryllus_schlosseri_TRINITY_DN1013_c0_g2::TRINITY_DN1013_c0_g2_i1::g.248881::m.248881': 'CTGATTCTTCTCATCGGCTCTCTCTGCTTCACTCTGACCCACGGCCTT...TCTGAAGGCCTACGCGATTGCCAAGACGGCCAAAGACTCTTACAACTGG', 'Botryllus_schlosseri_TRINITY_DN10166_c0_g2::TRINITY_DN10166_c0_g2_i1::g.27418::m.27418': 'GTACATTTGAAGAAAATGTCTGAATTGATCCATCACGAAAAAGCTTTC...GGTCATCTCGAACAAAATGCACAGAACCATCGTGATCAGAAGAGACTAC', 'Botryllus_schlosseri_TRINITY_DN1017_c0_g1::TRINITY_DN1017_c0_g1_i1::g.248877::m.248877': 'TGCCATCTTCCGGCACTCATGGCCAAGATTGACGAGGTGCACCACTCG...CCAAGTTAAGACCTTGCCCACTGAGAAGCATACCCAGCCGGAGTGCTAG', 'Botryllus_schlosseri_TRINITY_DN101_c0_g1::TRINITY_DN101_c0_g1_i1::g.75736::m.75736': 'GAGCGATATCACGCTCGCTCGTTCAGCAAGCGCGAACGGCTGTATCGG...GAACCGGACCTTCGTCGATCTACCTGAGTTCATAGATGAAATGGATCCG', 'Botryllus_schlosseri_TRINITY_DN10203_c0_g1::TRINITY_DN10203_c0_g1_i1::g.234676::m.234676': 'GAAGGGCGGTTCGTGCCGCACCTGGTGGTCCGCATCGACGCCGGCACG...GGTCAACACAGATAAGGTATCGGCTACCACGGCCATCGAGGTCTTGCGC', 'Botryllus_schlosseri_TRINITY_DN10220_c0_g2::TRINITY_DN10220_c0_g2_i1::g.234678::m.234678': 'GATCATCGGTATGGACTACCCGTGGAAATCGAGATGACTACAATGGAC...TCGAGTGGTGCTAAATGCTGCTACTGATACATTTCGTGGTATACTAGAT', 'Botryllus_schlosseri_TRINITY_DN10249_c0_g1::TRINITY_DN10249_c0_g1_i1::g.234674::m.234674': 'GAAGGACTCTCGTTGATCGAACGCCGCAACTGCATGGAGCTGTTCCGA...GGATCGTCTCGACAAGGTTTACGCCAGGCACAGCGACATGGTCCTGTTG', ...}, '/groups/pupko/ashermoshe/dorothee/ks_tmp.371cd0d9072862', 'codeml', False, 1, 100, 'fasttree', 'muscle', '/groups/pupko/ashermoshe/dorothee/wgd_ksd')
kwargs = {}
132
133 def __len__(self):
134 return self._size
135
...........................................................................
/share/apps/anaconda3-5.1.0/lib/python3.6/site-packages/wgd/ks_distribution.py in analyse_family(family_id='GF_000012', family={'Botryllus_schlosseri_TRINITY_DN47590_c0_g1::TRINITY_DN47590_c0_g1_i1::g.253656::m.253656': 'MGETKSTITNLYGHNNRAKRRTQLRELAIRTRTEKSHLIAGDFNGIDS...ARRQKDQLRSDEKEMQKVRNRVIQNQQYLSDYRKIKKKIWKEKAESLHK', 'Botryllus_schlosseri_TRINITY_DN50048_c0_g1::TRINITY_DN50048_c0_g1_i1::g.248848::m.248848': 'GAVISIDFLKAYDSVDHSFLHNTLEEAGFGVKVRAFFKAIYQGGSAKV...SGMKGKIATPSYADDVTITLAKEEESTKALQIVAEFGKASGLQINRKKT', 'Botryllus_schlosseri_TRINITY_DN50048_c0_g2::TRINITY_DN50048_c0_g2_i1::g.248849::m.248849': 'QKTTSQFARGIIKTIFKKGDKEDIRNYRPITILNVDYKIISKVITNRIQKVLPTITHRHQFINPPNTIGDLNLLLREVTSDMRERSRGA', 'Botryllus_schlosseri_TRINITY_DN50048_c0_g3::TRINITY_DN50048_c0_g3_i1::g.248850::m.248850': 'AGDFNGIDDIELDRDPVNIRHDAADAKYSKRVMEVIGVTDAFRQVHGS...CNHMPCPFSDHGATTALVKLTDHRPRRPNTWKNNTKVYEMEAFETELEV', 'Botryllus_schlosseri_TRINITY_DN50048_c0_g4::TRINITY_DN50048_c0_g4_i1::g.248851::m.248851': 'KTWADKATDLGKLQLEAKADAEEALGKRPHLLCEKIKVRRDAVSITAIKDATGKTTEDPEEIRETVEEFYQKLYSKRETDKCTANSFHRYQDAKLSTR', 'Botryllus_schlosseri_TRINITY_DN50200_c0_g1::TRINITY_DN50200_c0_g1_i1::g.61823::m.61823': 'GEDGLSSELYMVNLDLMKKELTEVYNEIYEAQGTTTSLGRAVLKIIHK...KNYRPISLLNSDYKILSKILTNRLKQALPSITHQHQHVNPPKTIGQINL', 'Botryllus_schlosseri_TRINITY_DN52423_c0_g2::TRINITY_DN52423_c0_g2_i1::g.262149::m.262149': 'GKANISIGGKLGGNIRLGRGIKQGDPISMLLFTMATDPLLQRLNHDLD...DVNITLAHQADVNEALKIIQDFEEASALKLNKNKSKGITYHPKPPPGSK', 'Botryllus_schlosseri_TRINITY_DN52423_c0_g3::TRINITY_DN52423_c0_g3_i1::g.262150::m.262150': 'PPPGSKNVLKWVQSMEVLGHVINRHPPNDHETWNGLANKAKDLMREIK...YVATLKKMPINTRRELETAVTELLFGKSMRPDYRKLIQRREAGGIGLVD', 'Botryllus_schlosseri_TRINITY_DN58171_c0_g1::TRINITY_DN58171_c0_g1_i1::g.196397::m.196397': 'LEYEIVEFLFGKGKRPEYKKLVQQEIAGGREVKDIPTITDIIFIKPAV...RTDHQLALTLGWLRERPINNSRPHTWQPRQHWAEMAKIMKEMEYKRDYI', 'Botryllus_schlosseri_TRINITY_DN61293_c1_g1::TRINITY_DN61293_c1_g1_i1::g.134659::m.134659': 'AAFVMDLDGKIEYEKIIQARESGGLELVDIPTMTDLAFVKPALRYLQR...MKRYGMRKINNAIPHVFQPLQHWQETEKTMRSLGRQQQDIKSKRRERYR', ...}, nucleotide={'Botryllus_schlosseri_TRINITY_DN10080_c0_g1::TRINITY_DN10080_c0_g1_i1::g.308497::m.308497': 'CTTAATTGTGCTTCATTTTTTAGCTCTCAAATGGCTCGATCAACGAAA...CATGGCGGAAGGCACGAAGGCCGTCGCCAAGCTCGCTGCCAGCAAATAA', 'Botryllus_schlosseri_TRINITY_DN10087_c0_g1::TRINITY_DN10087_c0_g1_i1::g.308496::m.308496': 'ATGGCGGTGGTGACACTGTTCTCGGTGGGGCATTCTAGGGAGGTGCGG...CTACCTCGACCCTCGGCTAGACCAACCATGGCCCCGCGTCGCGAAGGCC', 'Botryllus_schlosseri_TRINITY_DN1013_c0_g1::TRINITY_DN1013_c0_g1_i1::g.248880::m.248880': 'TGGGCAGTCCTGACTGGCACTATCAAGAATAACAGCAACATCCAATGC...GAAAAACTATGGGGACATTATTAACCCTGCTGATGGAATCTGTTTGACC', 'Botryllus_schlosseri_TRINITY_DN1013_c0_g2::TRINITY_DN1013_c0_g2_i1::g.248881::m.248881': 'CTGATTCTTCTCATCGGCTCTCTCTGCTTCACTCTGACCCACGGCCTT...TCTGAAGGCCTACGCGATTGCCAAGACGGCCAAAGACTCTTACAACTGG', 'Botryllus_schlosseri_TRINITY_DN10166_c0_g2::TRINITY_DN10166_c0_g2_i1::g.27418::m.27418': 'GTACATTTGAAGAAAATGTCTGAATTGATCCATCACGAAAAAGCTTTC...GGTCATCTCGAACAAAATGCACAGAACCATCGTGATCAGAAGAGACTAC', 'Botryllus_schlosseri_TRINITY_DN1017_c0_g1::TRINITY_DN1017_c0_g1_i1::g.248877::m.248877': 'TGCCATCTTCCGGCACTCATGGCCAAGATTGACGAGGTGCACCACTCG...CCAAGTTAAGACCTTGCCCACTGAGAAGCATACCCAGCCGGAGTGCTAG', 'Botryllus_schlosseri_TRINITY_DN101_c0_g1::TRINITY_DN101_c0_g1_i1::g.75736::m.75736': 'GAGCGATATCACGCTCGCTCGTTCAGCAAGCGCGAACGGCTGTATCGG...GAACCGGACCTTCGTCGATCTACCTGAGTTCATAGATGAAATGGATCCG', 'Botryllus_schlosseri_TRINITY_DN10203_c0_g1::TRINITY_DN10203_c0_g1_i1::g.234676::m.234676': 'GAAGGGCGGTTCGTGCCGCACCTGGTGGTCCGCATCGACGCCGGCACG...GGTCAACACAGATAAGGTATCGGCTACCACGGCCATCGAGGTCTTGCGC', 'Botryllus_schlosseri_TRINITY_DN10220_c0_g2::TRINITY_DN10220_c0_g2_i1::g.234678::m.234678': 'GATCATCGGTATGGACTACCCGTGGAAATCGAGATGACTACAATGGAC...TCGAGTGGTGCTAAATGCTGCTACTGATACATTTCGTGGTATACTAGAT', 'Botryllus_schlosseri_TRINITY_DN10249_c0_g1::TRINITY_DN10249_c0_g1_i1::g.234674::m.234674': 'GAAGGACTCTCGTTGATCGAACGCCGCAACTGCATGGAGCTGTTCCGA...GGATCGTCTCGACAAGGTTTACGCCAGGCACAGCGACATGGTCCTGTTG', ...}, tmp='/groups/pupko/ashermoshe/dorothee/ks_tmp.371cd0d9072862', codeml=<wgd.codeml.Codeml object>, preserve=False, times=1, min_length=100, method='fasttree', aligner='muscle', output_dir='/groups/pupko/ashermoshe/dorothee/wgd_ksd')
298 logging.debug("Distance will be in Ks units!")
299 clustering, pairwise_distances, tree_path = _weighting(
300 results_dict, msa=msa_path_protein, method="alc")
301 else:
302 clustering, pairwise_distances, tree_path = _weighting(
--> 303 results_dict, msa=msa_path_protein, method=method)
results_dict = {'Ka': ...
[94 rows x 94 columns], 'Ks': ...
[94 rows x 94 columns], 'Omega': ...
[94 rows x 94 columns]}
msa_path_protein = '/groups/pupko/ashermoshe/dorothee/ks_tmp.371cd0d9072862/GF_000012.fasta.msa'
method = 'fasttree'
304 if clustering is not None:
305 out = _calculate_weighted_ks(
306 clustering, results_dict, pairwise_distances, family_id
307 )
...........................................................................
/share/apps/anaconda3-5.1.0/lib/python3.6/site-packages/wgd/ks_distribution.py in _weighting(pairwise_estimates={'Ka': ...
[94 rows x 94 columns], 'Ks': ...
[94 rows x 94 columns], 'Omega': ...
[94 rows x 94 columns]}, msa='/groups/pupko/ashermoshe/dorothee/ks_tmp.371cd0d9072862/GF_000012.fasta.msa', method='fasttree')
93 elif method == 'fasttree':
94 # FastTree tree construction
95 logging.debug('Constructing phylogenetic tree with FastTree')
96 tree_path = run_fasttree(msa)
97 clustering, pairwise_distances = phylogenetic_tree_to_cluster_format(
---> 98 tree_path, pairwise_estimates['Ks'])
tree_path = '/groups/pupko/ashermoshe/dorothee/ks_tmp.371cd0d9072862/GF_000012.fasta.msa.nw'
pairwise_estimates = {'Ka': ...
[94 rows x 94 columns], 'Ks': ...
[94 rows x 94 columns], 'Omega': ...
[94 rows x 94 columns]}
99
100 else:
101 # Average linkage clustering based on Ks
102 logging.debug('Performing average linkage clustering on Ks values.')
...........................................................................
/share/apps/anaconda3-5.1.0/lib/python3.6/site-packages/wgd/phy.py in phylogenetic_tree_to_cluster_format(tree='/groups/pupko/ashermoshe/dorothee/ks_tmp.371cd0d9072862/GF_000012.fasta.msa.nw', pairwise_estimates= ...
[94 rows x 94 columns])
118 (only the index is used)
119 :return: clustering data structure, pairwise distances dictionary
120 """
121 id_map = {
122 pairwise_estimates.index[i]: i for i in range(len(pairwise_estimates))}
--> 123 t = Tree(tree)
t = undefined
tree = '/groups/pupko/ashermoshe/dorothee/ks_tmp.371cd0d9072862/GF_000012.fasta.msa.nw'
124
125 # midpoint rooting
126 midpoint = t.get_midpoint_outgroup()
127 if not midpoint: # midpoint = None when their are only two leaves
...........................................................................
/share/apps/anaconda3-5.1.0/lib/python3.6/site-packages/ete3/coretype/tree.py in __init__(self=Tree node '' (-0x7ffff800877d65c4), newick='/groups/pupko/ashermoshe/dorothee/ks_tmp.371cd0d9072862/GF_000012.fasta.msa.nw', format=0, dist=None, support=None, name=None, quoted_node_names=False)
206
207 # Initialize tree
208 if newick is not None:
209 self._dist = 0.0
210 read_newick(newick, root_node = self, format=format,
--> 211 quoted_names=quoted_node_names)
quoted_node_names = False
212
213
214 def __nonzero__(self):
215 return True
...........................................................................
/share/apps/anaconda3-5.1.0/lib/python3.6/site-packages/ete3/parser/newick.py in read_newick(newick='/groups/pupko/ashermoshe/dorothee/ks_tmp.371cd0d9072862/GF_000012.fasta.msa.nw', root_node=Tree node '' (-0x7ffff800877d65c4), format=0, quoted_names=False)
244 nw = nw.strip()
245 if not nw.startswith('(') and nw.endswith(';'):
246 #return _read_node_data(nw[:-1], root_node, "single", matcher, format)
247 return _read_newick_from_string(nw, root_node, matcher, format, quoted_names)
248 elif not nw.startswith('(') or not nw.endswith(';'):
--> 249 raise NewickError('Unexisting tree file or Malformed newick tree structure.')
250 else:
251 return _read_newick_from_string(nw, root_node, matcher, format, quoted_names)
252
253 else:
NewickError: Unexisting tree file or Malformed newick tree structure.
You may want to check other newick loading flags like 'format' or 'quoted_node_names'.
___________________________________________________________________________
"""
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "/share/apps/anaconda3-5.1.0/lib/python3.6/site-packages/joblib/parallel.py", line 699, in retrieve
self._output.extend(job.get(timeout=self.timeout))
File "/share/apps/anaconda3-5.1.0/lib/python3.6/multiprocessing/pool.py", line 644, in get
raise self._value
joblib.my_exceptions.TransportableException: TransportableException
___________________________________________________________________________
NewickError Wed Feb 27 18:10:41 2019
PID: 62472 Python 3.6.8: /share/apps/anaconda3-5.1.0/bin/python
...........................................................................