Giter VIP home page Giter VIP logo

nmn's Introduction

NMN

Source code and datasets for ACL 2020 paper: Neighborhood Matching Network for Entity Alignment.

Datasets

Please first download the datasets here and extract them into data/ directory.

Initial datasets DBP15K and DWY100K are from JAPE and BootEA.

Take the dataset DBP15K (ZH-EN) as an example, the folder "zh_en" contains:

  • ent_ids_1: ids for entities in source KG;
  • ent_ids_2: ids for entities in target KG;
  • ref_ent_ids: entity links encoded by ids;
  • triples_1: relation triples encoded by ids in source KG (DBP_ZH);
  • triples_1_s: remaining relation triples encoded by ids in source KG (S-DBP_ZH);
  • triples_2: relation triples encoded by ids in target KG (DBP_EN);
  • triples_2_s: remaining relation triples encoded by ids in target KG (S-DBP_EN);
  • vectorList.json: the input entity feature matrix initialized by word vectors;

Environment

  • Python>=3.5
  • Tensorflow>=1.8.0
  • Scipy
  • Numpy

Due to the limited graphics memory of GPU, we ran our codes using CPUs (40 Intel(R) Xeon(R) CPU E5-2640 v4 @ 2.40GHz).

Running

For example, to run NMN on DBP15K (ZH-EN), use the following script:

python3 main.py --dataset DBP15k --lang zh_en

Due to the instability of embedding-based methods, it is acceptable that the results fluctuate a little bit (±1%) when running code repeatedly.

If you have any difficulty or question in running code and reproducing expriment results, please email to [email protected].

Citation

If you use this model or code, please cite it as follows:

Yuting Wu, Xiao Liu, Yansong Feng, Zheng Wang and Dongyan Zhao. Neighborhood Matching Network for Entity Alignment. In: ACL 2020.

nmn's People

Contributors

stephaniewyt avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar

nmn's Issues

vectorList.json

Hi, there is no vectorList.json file in the zh_en folder of dbp15k

希望可以得到数据集的格式标准

您好,我正在构建一个新的跨语言知识图谱数据集,想使用NMN模型验证数据集的性能,但是对于模型需要的数据集格式不太清楚,如三元组中每个token所代表的意义。所以能否给予一份模型数据集所需格式的规范或对原始DBP15k数据集做预处理的代码?
谢谢!

process killed in dbp_yg dataset

Hi,

My process will be killed by itself when I run dataset dbp_yg. But there is no problem with the other datasets. Is there any parameters or other issues I ignored?

Best Regards.
1595229202

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.