Comments (14)
你好 欢迎贡献代码:
- 前端在这里生成:https://github.com/howie6879/owllook/blob/master/owllook/templates/novels/chapter.html
- 后端:134行 https://github.com/howie6879/owllook/blob/master/owllook/views/novels_blueprint.py
from owllook.
@howie6879
目前看起来是目录页直接返回HTML打印,这导致了结构不可控制。
比如“赤色黎明”返回的content
<table bgcolor="#E4E4E4" cellpadding="0" cellspacing="1" id="at">
<tbody>
<tr>
<td class="L"><a href="15117707.html">第一章</a></td>
<td class="L"><a href="15117710.html">第二章</a></td>
<td class="L"><a href="15117712.html">第三章</a></td>
<td class="L"><a href="15117715.html">第四章</a></td>
</tr>
more tr...
</tbody>
</table>
看起来只能解析标题列表然后在渲染。
from owllook.
对的 结构不可控
from owllook.
@howie6879
我看了下要改不少东西,估计只能针对每个站做采集规则。这边你是怎么考虑的呢?
from owllook.
没必要吧 我目前对每个选择器都写了样式。你如果要改就改这些样式就好 我只是没对移动端做详细适配 其实只是对这些选择器写一些移动适配就好。没那么复杂
from owllook.
取章节这里,我是取选择器下面的所有a链接,统一格式化成一种风格,这样移动端就一样了
from owllook.
选择器下面的a标签不一定都是目录
from owllook.
是的 ,需要过滤一些。
大体分三种:
一:dl dt dd
二:ul li
三:table
第一种章节目录可以完美搞过来,第二、三种我只要了正文章节,有的小标题没要了
from owllook.
很早前就写了不用选择器直接提取章节的一个接口 没切过来用
from owllook.
@howie6879 @serical
我的想法的话还是要做数据清洗,为了支持提供给移动端的API等等。
但是不知道howie6879如何考虑这个项目发展的。
from owllook.
其实我已经做了api 我初始目的想做小说推荐系统 现在这样有版权问题的
from owllook.
对于项目你们有什么意见呢 都可以提
from owllook.
如果同时提供盲人阅读就不需要版权许可。
我只是想要一个简单的小说阅读器不需要登录注册什么并支持移动端网页。
from owllook.
为什么提供盲人阅读就无需版权呢?
from owllook.
Related Issues (20)
- 爬起点的书名报错 HOT 3
- ModuleNotFoundError: No module named 'pymongo.mongo_replica_set_client' HOT 3
- 大佬,为啥现在https://owlook.com.cn/网站不行了,里面小说点进去具体章节都是404 HOT 4
- docker镜像构建 pipenv install 环节报错
- 没被破译的小说
- 你有点极端了
- 演示网站炸了
- 按照安装教程pipenv install 安装失败,小白一枚,麻烦看看 HOT 2
- 如何调试源码 HOT 3
- 利用docker,一直启动不起来,大佬们帮帮忙吧 HOT 1
- docker打包镜像出错 HOT 9
- 登录注册和阅读小说功能不能用 HOT 1
- 遇到和之前几位一样的MONGODB后缀问题 HOT 1
- 免费开源的小说阅读项目 https://github.com/yuenov/reader-api
- 修改config.py文件里面的redis配置不生效
- 渲染网页时遇到个错误 HOT 1
- 哪里可以关闭直接跳转第三方网站啊
- 提取章节标题,遇到有表情的章节 不能正确提取
- pipenv install --python 这一步一直报错是为什么
- 请求服务报错,TypeError: 'Request' object does not support item assignment HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from owllook.