Giter VIP home page Giter VIP logo

wmyblog's People

Contributors

taizihuang avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

wmyblog's Issues

原博客评论排序问题

您好,感谢您制作整理王老师的博客。
发现一个问题,王老师会在评论区删除不必要的评论,那么udn平台会给评论重新排序,比如删除66楼,那么原67楼就会自动变成66楼。程序采集的时候没有注意到这个问题,会保留原66楼的评论,新66楼就没有了,不知能否改善。

是否考虑给文章添加rss

看了一下,目前只有新回复消息的rss,没有文章本身的rss,是否考虑添加呢?

另外,注意到新回复rss有两个文件:rss.xml和rss_notify.xml,不知道二者具体区别在哪里呢?

辛苦了

辛苦了,把王孟源的文章,访谈和文字版都总结出来了。厉害厉害!

with open('./html/'+art_id+'.html','w') as f ——>with open("index.html", "w") as index:

with open('./html/'+art_id+'.html','w') as f ——>with open("index.html", "w") as index:
我想用文件搜索软件对每篇文章进行 检索/分析,使用过程发现 html文件名为art_id 不太友好,可否将每篇文件的html文件名 改成 索引用的名字 ;
如果不难的话,如能批量导出html为PDF最佳

p.s. 我最近加了王孟源粉丝群(七公读者群,目前101人),不知道兄台是否已经加入。

标签功能的一些问题

尝试了一下给评论打标签,有一些建议:

  • 在网页上最好显示对应的ID,我有时候搞不清楚复制成功了没有
  • 关于标签的说明比较模糊,如“对策”具体是指何种类型的文本,有无范例
  • “问答1000”是否由作者指定,即志愿者不要进行修改?
  • 关于地区,如提及两国关系(经贸、军事等),是否都标注两个国家?这样几乎所有所有评论都带有地区。希望有更清晰的标准

此外是否考虑过进行预标注?目前数据不足以训练比较有效的模型,可以先尝试一些策略进行预标注,如关键字匹配(像地区这类的就很容易匹配),之后就可尝试模型打标,志愿者来修改订正。我周末抽时间出来弄一下,不知有没有干净的数据(省得再去清理xml)?

感谢

无处留言,只能在此感谢。

我一直想备份王博blog, 但因才疏学浅,一直搁置。

直到遇见了你,对你真的佩服加感谢。

我曾是一个程序员,但不做web;
我喜欢交流,可惜用王博的世界观 在普适社会几乎无人能谈;若你愿意,可加v慢聊 15327522321

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.