Dataset and replication package for the paper Where is Your App Frustrating Users? (ICSE 2022).
- Python 3.6
- Main libraries: requirements.txt
- Pre-trained BERT
- Path: pytorch_version/prev_trained_model/
The directory dataset
is for the convenience of viewing the data, which contains labeled data and unlabeled data, respectively.
The data directory when running our code: pytorch_version/CLUEdatasets/cluener/
app: App name
text: Review sentence
senti: Sentence sentiment (negative: [-5, -1], positive: [1, 5])
label: The problematic feature phrase, the beginning position and the ending position
run_ner_crf.sh
You can change the configures in run_ner_crf.sh
, including the learning_rate
, per_gpu_train_batch_size
, per_gpu_eval_batch_size
, num_train_epochs
, etc. The other important parameters are
overwrite_output_dir -- whether overwrite the output directory
do_train -- whether train the model
do_eval -- whether evluate the model
do_predict -- whether predict the results of new data
This should be run after obtaining the results of Problematic Feature Extraction.
Need revision based on the format of your output data, and assign your data to the variable domain_docs
.
preprocess/clustering.py
pytorch_version/outputs/cluener_output/bert/
sira's People
sira's Issues
你好,使用transformers==4.0.0版本时会出现各种错误,
AttributeError: 'CNerTokenizer' object has no attribute 'vocab'
TypeError: init() got an unexpected keyword argument 'special_tokens_map_file'
TypeError: init() got an unexpected keyword argument 'tokenizer_file'
TypeError: init() got an unexpected keyword argument 'name_or_path'
会出现这些错误,请问transformers的版本有变化吗?
请问您的tensorflow版本是否更换?
AttributeError: module 'tensorflow.python.keras.utils.generic_utils' has no attribute 'populate_dict_with_module_objects'
我无法成功运行您的clustering.py
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.