Giter VIP home page Giter VIP logo

data-sets-surf-repository's People

Contributors

szarnyasg avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

data-sets-surf-repository's Issues

Are Social Network Benchmark (SNB) Interactive datasets SF 3000 and SF10000 available?

Hi. We found this work by accident and appreciate you making these generated datasets publicly available. We are in need of large-scale temporal synthetic graphs to evaluate our system for dynamic graphs. The largest dataset we only found in the repo is SF1000, but we would like to have a larger graph. The SF3000 and SF10000 datasets are mentioned in the paper, but I could not find the links in this repo. Is there a download link for these two datasets? Thanks!

Unable to download file

Hi,
I am unable to download files or request them as I am not affiliated with any of the institutes mentioned on the login site on SURF. Please let me know how I can access the files.

Thank you

duplicate header values causing issues

Hi,

Certain Social Network Benchmark csv file have duplicate header names, which can complicate parsing. I am looking for suggestions.

Given the following data:

Person.id|Person.id|creationDate
933|4139|2010-03-13T07:37:21.718+0000
...

We run this snippet:

import csv
filepath = 'social_network-csv_basic-sf1/dynamic/person_knows_person_0_0.csv'
with open(filepath, "r") as f:
    data = csv.DictReader(f, delimiter="|")
    for d in data:
        print(d)

And observe the following:

{'Person.id': '4139', 'creationDate': '2010-03-13T07:37:21.718+0000'}

Notice how Person.id 933 no longer exists.

Looking for suggestions.

Missing IS (short read) Substitution Parameters

The Substitution parameters currently only contain the BI and IC parameters but not the Interactive Short query parameters. Are there any current suggestions for those when using the surf datasets?

Datasets with different date formatter have different contents

Datasets with the same scale factor and serializer but different date formatter have different contents.

For example, the csv_basic dataset with the string date formatter has 14074 knows edges but the csv_basic dataset with the long date formatter has 18075 knows edges:

% wc -l social_network-csv_basic-sf0.1/dynamic/person_knows_person_0_0.csv 
   14074 social_network-csv_basic-sf0.1/dynamic/person_knows_person_0_0.csv
% wc -l social_network-csv_basic-longdateformatter-sf0.1/dynamic/person_knows_person_0_0.csv
   18075 social_network-csv_basic-longdateformatter-sf0.1/dynamic/person_knows_person_0_0.csv

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.