Comments (4)
Hi,
Thank you for your interest in Code2vec.
I'm just letting you know that the encoder of code2seq is much better than code2vec's and does not use hashing. The implementation and extending to other languages is very similar.
Regarding your questions:
- The hash function was simply Java's
String#hashCode()
. - No, there is no need to unhash.
- I am guessing that the arrows contribute only a few additional points. You can definitely drop them as a first step. In our PLDI'18 paper - without arrows the results were very similar (for other tasks and another language though)
Best,
Uri
from code2vec.
Yes! It is only intended to shorten the long path strings into shorter strings.
from code2vec.
Thank you for your prompt reply!
from code2vec.
So ideally I should be okay with using any hash function then?
from code2vec.
Related Issues (20)
- Preprocessor step disposing numbers in (variable) names HOT 4
- How to release a model HOT 1
- Repeating metric values HOT 3
- Model for other task. HOT 2
- I run this "python3 code2vec.py --load models/dataset/saved_model_iter2 --test data/dataset/dataset.test.c2v" and I got this issue! is there any help? HOT 5
- I don't know how to apply the output files created by astminer. HOT 1
- Can I get the exact values for the context HOT 2
- Matrix size-incompatible during using sample model HOT 2
- bias-variance tradeoff HOT 1
- Application to real case study HOT 11
- Javascript Benchmark with Code2Vec HOT 3
- There is no entire model and model weights file to load HOT 4
- How to create code embeddings from Java codebase and store it in a vector database? HOT 4
- Issues encountered when processing big data HOT 1
- File Not found error HOT 2
- Queries regarding Java Extractor HOT 1
- Which version of JDK do I need to install before running this project? HOT 3
- How to create code2vec input HOT 7
- Queries on ...dict.c2v file HOT 1
- Is there any library or API available for generating embeddings of each line of a Java code file while preserving AST (Abstract Syntax Tree) structure information? I'm already familiar with fold2vec. Are there any other alternatives? HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from code2vec.