Giter VIP home page Giter VIP logo

Comments (14)

howie6879 avatar howie6879 commented on July 1, 2024

你好 欢迎贡献代码:

from owllook.

kslr avatar kslr commented on July 1, 2024

@howie6879
目前看起来是目录页直接返回HTML打印,这导致了结构不可控制。
比如“赤色黎明”返回的content

<table bgcolor="#E4E4E4" cellpadding="0" cellspacing="1" id="at">
<tbody>
<tr>
<td class="L"><a href="15117707.html">第一章</a></td>
<td class="L"><a href="15117710.html">第二章</a></td>
<td class="L"><a href="15117712.html">第三章</a></td>
<td class="L"><a href="15117715.html">第四章</a></td>
</tr>
more tr...
</tbody>
</table>

看起来只能解析标题列表然后在渲染。

from owllook.

howie6879 avatar howie6879 commented on July 1, 2024

对的 结构不可控

from owllook.

kslr avatar kslr commented on July 1, 2024

@howie6879
我看了下要改不少东西,估计只能针对每个站做采集规则。这边你是怎么考虑的呢?

from owllook.

howie6879 avatar howie6879 commented on July 1, 2024

没必要吧 我目前对每个选择器都写了样式。你如果要改就改这些样式就好 我只是没对移动端做详细适配 其实只是对这些选择器写一些移动适配就好。没那么复杂

from owllook.

serical avatar serical commented on July 1, 2024

取章节这里,我是取选择器下面的所有a链接,统一格式化成一种风格,这样移动端就一样了

from owllook.

howie6879 avatar howie6879 commented on July 1, 2024

选择器下面的a标签不一定都是目录

from owllook.

serical avatar serical commented on July 1, 2024

是的 ,需要过滤一些。
大体分三种:
一:dl dt dd
二:ul li
三:table
第一种章节目录可以完美搞过来,第二、三种我只要了正文章节,有的小标题没要了

from owllook.

howie6879 avatar howie6879 commented on July 1, 2024

很早前就写了不用选择器直接提取章节的一个接口 没切过来用

from owllook.

kslr avatar kslr commented on July 1, 2024

@howie6879 @serical
我的想法的话还是要做数据清洗,为了支持提供给移动端的API等等。
但是不知道howie6879如何考虑这个项目发展的。

from owllook.

howie6879 avatar howie6879 commented on July 1, 2024

其实我已经做了api 我初始目的想做小说推荐系统 现在这样有版权问题的

from owllook.

howie6879 avatar howie6879 commented on July 1, 2024

对于项目你们有什么意见呢 都可以提

from owllook.

kslr avatar kslr commented on July 1, 2024

如果同时提供盲人阅读就不需要版权许可。
我只是想要一个简单的小说阅读器不需要登录注册什么并支持移动端网页。

from owllook.

howie6879 avatar howie6879 commented on July 1, 2024

为什么提供盲人阅读就无需版权呢?

from owllook.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.