Giter VIP home page Giter VIP logo

bmecat2goodrelations's Introduction

Prepare Your Business for the Future of Commerce! - Create GoodRelations Product Data from BMEcat XML Catalogs

About

This portable command line tool aims at facilitating the generation of GoodRelations documents to large or medium-sized enterprises that already support the BMEcat format (e.g. via PIM systems). The resulting output is intended for a quick and efficient publication of highly structured offers on a Web scale.

Features

The converter script consumes a BMEcat 2005 document (also downwards compatible with BMEcat 1.2) and creates a bunch of small documents including GoodRelations data. The script creates

  • a sitemap file,
  • a gzipped dump file (N-Triples) incorporating the whole data set,
  • an offer/model file for each single offering,
  • a data file including all defined product features, and
  • if available, a data file describing the full proprietary catalog structure and an HTML representation thereof.

The prerequisites to run the script are:

Other features:

  • Permissiveness and completeness. The converter is tolerant with regard to unexpected input, i.e. it does not break on corrupted files with elements that appear at positions other than intended for. There are basically two versions of BMEcat in circulation, BMEcat 1.2 and BMEcat 2005. The differences between the two releases are manageable, most updates concern enhancements of the standard. Probably the most notable change was the renaming of any occurrence of ARTICLE to PRODUCT. The converter is able to deal with both versions and accepts even a mixture of them. Furthermore, BMEcat2005 introduced multi-language support which is also regarded appropriately by our tool.
  • Extensibility. Provided a group identifier and a base URI for a specific classification standard it is possible to establish the linkage to externally defined classes and properties. In the current version, this functionality is already built-in for eCl@ss 5.1.4, using its RDF representation provided by eClassOWL.
  • User-friendliness and standards-compliance. The converter script is invoked from command-line, where it provides usage instructions. The tool affords an opportunity to select whether to reveal the transactional data (in particular, details like price information or delivery options) or to keep them private. If the latter option is chosen the tool will only produce product model data instead of complete offering descriptions.

In addition to that, the HTML output that is produced in conjunction with the catalog group system serves as a navigable document to discover the class hierarchy the product models belong to. If published online together with all other files, the human-readable document allows to dereference the categories of the product models that were assigned to them, and this way helps to explore the context and thus to disambiguate product models more easily.

Quickstart

The basic usage of the command line tool is as follows:

python main.py [options] FILE

You'll get a pretty nice summary of the possible options if you issue the following command:

python main.py --help

Publications

Alex Stolz, Bene Rodriguez-Castro, and Martin Hepp: Using BMEcat Catalogs as a Lever for Product Master Data on the Semantic Web. Proceedings of the 10th Extended Semantic Web Conference (ESWC 2013), (Montpellier, France, 2013). (Slides)

License and Source Code

The source code distributed via this Google repository is available under a free LGPL license.

Acknowledgements

BMEcat2GoodRelations is available under the terms of the GNU Lesser General Public License. The work on this project has been supported by the German Federal Ministry of Education and Research (BMBF) by a grant under the KMU Innovativ program as part of the Intelligent Match project (FKZ 01IS10022B).

http://www.bmbf.de/en/

bmecat2goodrelations's People

Contributors

alexstolz avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.