Comments (6)
As an example, the magic table scraper stores its settings in allSettings.json. Other tools / scrapers store their settings in a database table called swvariables (thanks to scraperwiki.sqlite.save_var).
This is an attempt to make a single (decent) way of doing this so we don't keep re-inventing this stuff.
from scraperwiki-python.
Three interesting decisions I like the sound of:
- It's just a dictionary, you can access variables by key rather than as an argument to a function.
- It's the same in Python and JavaScript.
- It basically has nothing to do with SQLite.
Although that 3rd one does make me wonder why it's under the "scraperwiki.sql" namespace – the fact that it's based on SQL is an implementation detail. Perhaps it should be "scraperwiki.var"?
from scraperwiki-python.
I've added an issue in the Custard repo for the JavaScript half of this: sensiblecodeio/custard#400
from scraperwiki-python.
It's in the sql namespace because it stores the sqlite store. I don't think it's an implementation detail, I think it's an advertised feature of the interface.
On the issue of it living in .sql or not I'm persuadable.
It should be called .variable, not .var
from scraperwiki-python.
@pwaller argues, persuasively IMO, that in Python it should mostly look like property access rather than dictionary access:
scraperwiki.sql.variable.my_favourite_status = "begruddled"
This is so that people are discouraged, by the syntax, from doing programmatic things to the variables (like iterating over all the variables, if people want to do that, they should probably be storing things in a table).
I approve.
obviously in JavaScript land, it's all the same.
from scraperwiki-python.
For examples of bad code that results from the lack of this facility, see the Twitter search tool: http://github.com/scraperwiki/twitter-search-tool/
One thing in particular - saving an individual variable without danger of damaging others is very important. i.e. it internally doing an "update ... where" for just the variable.
There's an interesting question about when the variable gets gotton again - i.e. when it does a round trip from Javascript. I'm not sure what the answer is though - needs a bit of discussion.
I've made this a Trello card: https://trello.com/c/kWLodpHG/66-variable-saving-loading-for-tools-simpler
from scraperwiki-python.
Related Issues (20)
- Error on `atexit` event when using only pdftoxml util function HOT 3
- Fix test failure in Python 3.5 and up
- `unique_keys` doesn't work for buffered data
- scraperwiki does not support python version 3.10.7
- pdftoxml() docs inconsistent with code HOT 4
- pdftoxml in utils.py is not portable to Windows. HOT 3
- utils.py uses both urllib2 and requests to make requests HOT 3
- cannot `import scraperwiki` on read only directory HOT 3
- doubleflush bug HOT 11
- No new release since July HOT 2
- INSERT OR REPLACE INTO is sqlite-specific HOT 10
- Cannot change the columns after the first save. HOT 5
- Can't save lxml strings. HOT 6
- mysterious "no such table" crash HOT 4
- SQLAlchemy error storing empty strings HOT 2
- Not Python 3 compatible HOT 6
- Unicode column names fail the second time you use them.
- sqlalchemy.exc.OperationalError on writing to a table `details` HOT 5
- Dependency on pdftohtml HOT 2
- sql.save has weird functionality with existing data
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from scraperwiki-python.