Giter VIP home page Giter VIP logo

Comments (5)

alvra avatar alvra commented on July 21, 2024

Thanks for the report, not sure if this is a bug in spotnet itself (as opposed to the nntp post xml) but it seems fixable so I'll have a look one of these days.

Feel free to report on any fixable errors in updating, even if they're not strictly spotnet errors. I'd like to be able to parse as many posts as possible!

from django-spotnet.

hagst avatar hagst commented on July 21, 2024

573d53b: Allow some extra posts that contain formally invalid xml to be parsed.
This fix works and i haven't experienced any side affects.
Thanks for fixing.

from django-spotnet.

hagst avatar hagst commented on July 21, 2024

Spots with & between <Image> </Image> in the header without ![CDATA[ ]] are skipped

Post has invalid XML data for header X-XML: not well-formed (invalid token): line 1, column 832 =&
message-id [email protected]
Date: 18 Nov 2012 18:05:26 GMT

Post has invalid XML data for header X-XML: not well-formed (invalid token): line 1, column 819 =&
message-id [email protected]
Date: 18 Nov 2012 16:37:03 GMT

Post has invalid XML data for header X-XML: not well-formed (invalid token): line 1, column 1402 =&
message-id [email protected]
Date: 13 Nov 2012 18:05:32 GMT

Spots with & between <Tag> </Tag> in the header without ![CDATA[ ]] are skipped

Post has invalid XML data for header X-XML: not well-formed (invalid token): line 1, column 109 =&
message-id [email protected]
Date: 30 Oct 2012 13:22:11 GMT

Spots with & between <Website></Website> in the header without ![CDATA[ ]] are skipped

Post has invalid XML data for header X-XML: not well-formed (invalid token): line 1, column 913 = &
message-id [email protected]
Date: 26 Oct 2012 09:30:29 GMT
���
Spots with between <Description> </Description> in the header are skipped
Post has invalid XML data for header X-XML: not well-formed (invalid token): line 1, column 203
message-id [email protected]
Date: 08 Nov 2012 11:34:10 GMT

Skipped spot because of splitting 2 X-XML lines over multiples lines
Post has invalid XML data for header X-XML: unclosed CDATA section: line 1, column 316
message-id [email protected]
Date: 17 Sep 2012 08:11:23 GMT
Fix: Add X-XML: for the splitted lines or merge everything between X-XML: to the next X-XML resulting in 2 X-XML:.

from django-spotnet.

alvra avatar alvra commented on July 21, 2024

Except for the one with the error in parsing the Description, these should now be fixed.

from django-spotnet.

hagst avatar hagst commented on July 21, 2024

Test results after updating Spotnet to version 4e4fe6a
The spots mentioned above are now valid including the spot with the error in parsing the Description.

Spot with in the header
Post has invalid XML data for header X-XML: not well-formed (invalid token): line 1, column 1853
messageid [email protected]
Date: Thu, 19 Jul 2012 18:22:00 GMT
header shows <Image> <![C�ATA[http://x.x.x.jpg]]> </Image>

Spot with % in the header
Post has invalid XML data for header X-XML: not well-formed (invalid token): line 1, column 64 =%
messageid [email protected]
Date: Fri, 13 Jul 2012 08:53:03 GMT
Header shows <Post%r>name</Poster> instead of <Poster>name</Poster>

Some spots with part of the header in the body
Header shows <Description> and no </Description>.
Result: incomplete description, missing <Website>, <Image>, <Category>, <NZB>, </Posting>, </Spotnet>
The missing part of the header is listed as first in the body.

Post has invalid XML data for header X-XML: unclosed CDATA section: line 1, column 188
messageid [email protected]
Date: Mon, 09 Jul 2012 21:10:33 +0200
Organization: Newsgrabber

Post has invalid XML data for header X-XML: unclosed CDATA section: line 1, column 197
messageid [email protected]
Organization: Newsgrabber
Date: Tue, 11 Sep 2012 22:28:26 +0200
Same spotter and same problem [email protected]

Post has invalid XML data for header X-XML: unclosed CDATA section: line 1, column 339
messageid [email protected]
Date: Wed, 01 Aug 2012 16:23:48 -0500
Spot placed using Supernews with SSL (known Supernews problem)

Some spots with incomplete header
Header shows <Description> and no </Description>.
Result: incomplete description, missing <Website>, <Image>, <Category>, <NZB>, </Posting>, </Spotnet>
The missing part of the header is also not listed in the body.

Post has invalid XML data for header X-XML: unclosed CDATA section: line 1, column 1766
messageid [email protected]
Date: Tue, 17 Jul 2012 13:00:16 +0200
Organization: Newsxs (Secured through NewsXS SSL)

Post has invalid XML data for header X-XML: unclosed CDATA section: line 1, column 1766
messageid [email protected]
Date: Tue, 17 Jul 2012 13:01:55 +0200
Organization: Newsxs (Secured through NewsXS SSL)

from django-spotnet.

Related Issues (12)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.