Giter VIP home page Giter VIP logo

drrepair's Issues

about dataset

Hi, Dear Author @michiyasunaga, I successfully downloaded the datasets and tried to understand JSON's internal contents. If understand correctly, are the codes in X['lines'] are fixed codes?

I also found that each program (in a single json file) has a lot groups of buggy information by ["err_line", "err_msg", "mod_line", "mod_code"]. If my understanding is correct, when using the workable program in X['lines'], we can independently generate multiple buggy codes by each group of ["err_line", "err_msg", "mod_line", "mod_code"], is that a correct usage?

Unable to evaluate model (on macOS)

DrRepair/evaluation/deepfix/out/code-compiler--2l-graph [master]
% for entry in ${test_split_root}/*
do
  probid=`basename $entry`
  python3 -u ../../test_deepfix.py \
  --input-code-dir ${program_data_root}/${probid}/erroneous \
  --repairer-server  http://0.0.0.0:8002/pred
done

Traceback (most recent call last):
  File "../../test_deepfix.py", line 331, in <module>
    main()
  File "../../test_deepfix.py", line 325, in main
    stitch()
  File "../../test_deepfix.py", line 290, in stitch
    stitch_helper(prog_fname)
  File "../../test_deepfix.py", line 231, in stitch_helper
    _code_str_tokenized = ' '.join(tokenize_code(_code, mod_brace=False))
DrRepair/utils/code_process.py", line 44, in tokenize_code
    clang.cindex.Config.set_library_path('/usr/local/Cellar/llvm/12.0.0/lib/')
  File "/Library/Python/3.8/lib/python/site-packages/clang/cindex.py", line 4107, in set_library_path
    raise Exception("library path must be set before before using " \
Exception: library path must be set before before using any other functionalities in libclang.

list index out of bounds error

tok=ex.src_vocab[idx - len(self.vocab)] in file DrRepair/model/repairer/data/err_dataset.py line 181 throws list index out of bounds error. How should i go about resolving this issue? I'm using the preprocessed models.

What is the word "Vanilla" or "Substitute"?

Hello, Can you tell me what is the "Vanilla" or "Substitute"? I can see "SubstituteErrData" and "VanillaErrData".Why is the name chosen like this? What is their difference?Thanks.

clang error

./2.run-gen-err-dataset--auto-corrupt--spoc.sh 

mkdir: cannot create directory ‘err-data-compiler--auto-corrupt--additional-codeforce--spoc-style’: File exists
joblib.externals.loky.process_executor._RemoteTraceback: 
"""
Traceback (most recent call last):

.local/lib/python3.8/site-packages/clang/cindex.py", line 4178, in get_cindex_library
    raise LibclangError(msg)
clang.cindex.LibclangError: libclang-11.so: cannot open shared object file: No such file or directory. To provide $
 path to libclang use Config.set_library_path() or Config.set_library_file().
"""

This fixed it for me

pip install clang
pip install libclang

RuntimeError: "index_select_out_cuda_impl" not implemented for 'Float'

Dear Dr. michiyasunaga

Hello ,I have a error when I train the dataset,

`TRAIN: 3%|▎ | 4993/150000 [15:45<7:43:02, 5.22it/s]TRAIN @ 5000: (n=838, loss_localize=0.246899, loss_edit=6.836520, acc_localize=73.63%, acc_edit1=21.72%, acc_edit2=0.00%, acc_repair=0.00%, grad_norm=27.990015)
Saving model to checkpoint 5000
Loaded 2010 dev examples

DEV: 0it [00:00, ?it/s]�[A
DEV: 0it [00:00, ?it/s]
Traceback (most recent call last):
File "/workspace/DrRepair/model/main_spoc.py", line 75, in
main()
File "/workspace/DrRepair/model/main_spoc.py", line 65, in main
experiment.train()
File "/workspace/DrRepair/model/repairer/experiments.py", line 146, in train
stats = self.process_batch(dev_batch, train=False, fout=fout)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/workspace/DrRepair/model/repairer/experiments.py", line 224, in process_batch
logit_edit1, label_edit1 = self.model.forward_edit(batch, all_enc_stuff, train_mode=False, beam_size=10) #follow the edit_lineno given
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/workspace/DrRepair/model/repairer/model/err_localize_edit.py", line 777, in forward_edit
dec_output, padded_gold_code_line = self.forward_helper_decode(batch, packed_dec_input, src_vocabs, src_map, train_mode) #(max_seq_len, batch_size, vocab_size)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/workspace/DrRepair/model/repairer/model/err_localize_edit.py", line 851, in forward_helper_decode
allHyp, allScores = self.beam_decode(hidden, enc_output, mask, extra_feed, src_vocabs, src_map)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/workspace/DrRepair/model/repairer/model/err_localize_edit.py", line 971, in beam_decode
beam.advance(log_probs, attn)
File "/workspace/DrRepair/model/repairer/model/beam_search_onmt.py", line 303, in advance
[self.alive_seq.index_select(0, self.select_indices),
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
RuntimeError: "index_select_out_cuda_impl" not implemented for 'Float'

TRAIN: 3%|▎ | 5000/150000 [15:54<7:41:08, 5.24it/s]`

I've tried many ways to always get an error, maybe it's a problem with the torch/cuda version?

Thanks in advance~

0% trained data trained

I keep following the steps in the ReadMe but whatever I do after the training phase it always shows TRAIN: 0% |.

About the license

Hello, it seems there is no information about the license in this repository.
Can I use your codes and dataset?

about test

Dear Dr. Yasunaga
Hello ,Dear Author @michiyasunaga , I am reproducing your paper(Graph-based, Self-Supervised Program Repair from Diagnostic Feedback).
I follow the step of your github.When i evaluate the model(code-compiler--2l-graph), I found the amount of passed and failed is 1,260.It is different from the paper(6,971).Is this path(/data/err-data-compiler--auto-corrupt--orig-deepfix/bin4) the real path of test set?
How can i evaluate the Deepfix raw test?

I would be very grateful indeed for any help you could give me.

Best wishes

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.