ldbc / data-sets-surf-repository Goto Github PK

View Code? Open in Web Editor NEW

14.0 14.0 3.0 98 KB

Home Page: https://ldbcouncil.org/data-sets-surf-repository/

Shell 100.00%

data-sets-surf-repository's People

Contributors

Stargazers

Watchers

Forkers

excelwang cpak00 hongtaicao

data-sets-surf-repository's Issues

Are Social Network Benchmark (SNB) Interactive datasets SF 3000 and SF10000 available?

Hi. We found this work by accident and appreciate you making these generated datasets publicly available. We are in need of large-scale temporal synthetic graphs to evaluate our system for dynamic graphs. The largest dataset we only found in the repo is SF1000, but we would like to have a larger graph. The SF3000 and SF10000 datasets are mentioned in the paper, but I could not find the links in this repo. Is there a download link for these two datasets? Thanks!

Server Error when attempting to download datasets

Using curl I receive a small invalid file and when attempting to click the link via README I receive a 500 error.

Unable to download file

Hi,
I am unable to download files or request them as I am not affiliated with any of the institutes mentioned on the login site on SURF. Please let me know how I can access the files.

Thank you

duplicate header values causing issues

Hi,

Certain Social Network Benchmark csv file have duplicate header names, which can complicate parsing. I am looking for suggestions.

Given the following data:

Dataset: https://repository.surfsara.nl/datasets/cwi/snb/files/social_network-csv_basic/social_network-csv_basic-sf1.tar.zst
File: dynamic/person_knows_person_0_0.csv (there are other multiple cases like this)

Person.id|Person.id|creationDate
933|4139|2010-03-13T07:37:21.718+0000
...

We run this snippet:

import csv
filepath = 'social_network-csv_basic-sf1/dynamic/person_knows_person_0_0.csv'
with open(filepath, "r") as f:
    data = csv.DictReader(f, delimiter="|")
    for d in data:
        print(d)

And observe the following:

{'Person.id': '4139', 'creationDate': '2010-03-13T07:37:21.718+0000'}

Notice how Person.id 933 no longer exists.

Looking for suggestions.

% wc -l social_network-csv_basic-sf0.1/dynamic/person_knows_person_0_0.csv 
   14074 social_network-csv_basic-sf0.1/dynamic/person_knows_person_0_0.csv
% wc -l social_network-csv_basic-longdateformatter-sf0.1/dynamic/person_knows_person_0_0.csv
   18075 social_network-csv_basic-longdateformatter-sf0.1/dynamic/person_knows_person_0_0.csv

Data set for sf1000

It looks like the link for data set with scale factor 1000 is not working
https://repository.surfsara.nl/datasets/cwi/snb/files/social_network-csv_basic/social_network-csv_basic-sf1000.tar.zst

Could you help fix? cc @szarnyasg

ldbc / data-sets-surf-repository Goto Github PK

data-sets-surf-repository's People

Contributors

Stargazers

Watchers

Forkers

data-sets-surf-repository's Issues

Are Social Network Benchmark (SNB) Interactive datasets SF 3000 and SF10000 available?

Server Error when attempting to download datasets

Unable to download file

duplicate header values causing issues

Missing IS (short read) Substitution Parameters

Direct download links are not working

Datasets with different date formatter have different contents

Data set for sf1000

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent