Giter VIP home page Giter VIP logo

Comments (6)

kjamsh avatar kjamsh commented on September 15, 2024

Hi Licheng,

Thanks for reaching out! I'm having trouble reproducing the crash. convert_data seems to handle that input fine on my machine. Could you tell me more about the environment? What is your compiler version? Did you use the TBB library in the repo or one that's installed on your computer? Have you made any changes to the code? I'd love to get to the bottom of this :)

As for interpreting the results: although it's called frequent subgraph mining, the application returns the frequent patterns instead. So [2,0-4,0] doesn't refer to an edge in the data graph, but an edge in a pattern graph. What Peregrine is saying is that there are 3 vertex-induced patterns with 4 vertices whose matches in the data graph meet the support threshold of 4. The () pairs encode anti-edges. For example the anti-edge (1,0~4,0) in the first pattern means that no matches for pattern vertex 1 are connected to the matches for vertex 4. The paper contains more information on this.

Let me know if you have any other questions!

from peregrine.

Licheng-Guo avatar Licheng-Guo commented on September 15, 2024

Hi Kasra,

Thanks for your explanation!

For the convert_data issue, it seems that it will only crash on tiny graphs. I'm running Ubuntu 18.04 with g++-10 and I did not make any modification to the code.

Thanks for clarifying the result format! In my scenario I need to know all the subgraphs of each pattern, so I wonder if there is an easy way to dump the actual subgraphs for each pattern?

Thanks a lot for your help!

Best,
Licheng

from peregrine.

kjamsh avatar kjamsh commented on September 15, 2024

I'm happy to help! You have to make a couple modifications to the sample application in order to output subgraphs. I'd prefer we discuss that over email so we can keep the issue board focused. If you shoot me an email (the address is on my profile) with what kind of output you want and the the size of graphs you're working with I can suggest an appropriate method.

Coming back to the crashes, do you mind attaching the exact files you passed to convert_data?

from peregrine.

Licheng-Guo avatar Licheng-Guo commented on September 15, 2024

Sure, I attached two files. "sample2.txt" will work while "sample3.txt" will segfault.

sample2.txt
sample3.txt

from peregrine.

kjamsh avatar kjamsh commented on September 15, 2024

I just pushed a fix! Let me know if you have any other issues, or if this one is persisting.

from peregrine.

lady-bluecopper avatar lady-bluecopper commented on September 15, 2024

Hello! I also get a segmentation fault when trying to convert a big graph (Nodes=4589876, Edges=43968798). The files are huge and GitHub does not allow me to attach them, but I am uploading them here: https://www.dropbox.com/sh/0dyctovvq6z1uf0/AAALfHnYbn3fXQZ6AlEfyhWaa?dl=0
I run the command:
bin/convert_data data/peregrine_format/youtube_edges.txt data/peregrine_format/youtube_labels.txt data/youtube/
I hope you can help me!
Thanks,
Giulia

from peregrine.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.