Giter VIP home page Giter VIP logo

orapi's People

Contributors

tholzheim avatar wolfgangfahl avatar

Watchers

 avatar  avatar

orapi's Issues

Fix loading of custom or backup over ConferenceCorpus

ConferenceCorpus currently is focused on loading predefined datasources. With the different backups of openresearch we need to load the or backup from given or-wikis such as orfixed without messing up the CC database

Validation phase example check Homepage

Cases:

  • Homepage exists
  • Homepage exists but wrong content (define wrong e.g. acronym or title can not be found)
  • Homepage does not exist
    • check internet archive at time of creation of record or start date of event

Proper MenuBar

Switch to MenuBar Widget for

  • proper handling of basedUrls
  • highlighting of active page

iswc upload fails

File "/home/th/.local/lib/python3.8/site-packages/corpus/smw/topic.py", line 112, in pushToWiki
self.updateWikiText(overwrite=overwrite)
File "/home/th/.local/lib/python3.8/site-packages/corpus/smw/topic.py", line 87, in updateWikiText
self.wikiFile.updateTemplate(self.entity.templateName, wikiSonRecord, overwrite=overwrite)
File "/home/th/.local/lib/python3.8/site-packages/wikifile/wikiFile.py", line 245, in updateTemplate
WikiFile.update_arguments(template, args, overwrite, prettify)
File "/home/th/.local/lib/python3.8/site-packages/wikifile/wikiFile.py", line 311, in update_arguments
template.set_arg(key, value+postfix, preserve_spacing=False)
TypeError: unsupported operand type(s) for +: 'int' and 'str
ISWC-Updated.ods
'

Add CI

  • add CI
  • add server startup to test setup
  • setup wikidocker to test the upload

Excel support

This is a method for creating excel compatible output ... Could we implement that?

df=pd.DataFrame(lod)
writer = pd.ExcelWriter(f'{eventSeriesAcronym}.xlsx', engine='xlsxwriter')
df.to_excel(writer, index = False)
writer.save()

Add CI tests

Test environment:

  • mock pushing a few event pages into the target wiki

Tests:

  • test csv download from source wiki
  • test mocked csv upload โ†’ test if changes are applied
    • add option to start orapi without user verification (for tests)

Add commandline interface to start orapi

required params:

  • wikiId: id of the wiki for which the api should be provided
  • wikiTextPath: path to the location of the wikiMarkupFiles to init CC
    • might switch to live extraction (directly query pages from the wiki)

wikidata synchronization support

Integration tests with docker

  • ls an integration of wikidocker into the CI useful/necessary?
    Personally i think this should only be done if there is problem in the interaction with the components. Otherwise each component should do it's own reliability tests.

Publish progress as links to source/preview and target wiki

Currently Publish completed is shown and a json rep which is not sorted.

The list and the upload should be sorted by year / ordinal going from the oldest to the newest so that the upload is finished when the most recent event is uploaded.

Two - Three links should be displayed.

  • source wiki
  • preview wiki (optional and currently probably useful)
  • target wiki (important)
    There should also be a delta view option / display giving a hint on how big the chances have been or inspecting these. For the target and preview wiki this can easily be done with a link to the history page of the wiki involved. See original Y-Principle generator written in Java for a potential UI for this.

Add Metadata to excel exports

i.e. the excel sheets should contain metadata about the used properties e.g. a mapping between column names and names of the properties in potential target datasets such as openresearch, wikidata, etc.

  • display of the metadata at the frontend?

abstract Tableediting

enhancing, checking and spreadsheet editing should have a testable and useable API

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.