Comments (3)
could you try loading bert from your local python interpreter and check if it's downloading BERT.
from rat-sql.
Hi, @DevanshChoubey
yes, I have checked that and it is working. The command I have used is
python -c "from transformers import BertModel; BertModel.from_pretrained('bert-large-uncased-whole-word-masking')"
Command-line output:
Downloading: 100%|█████████████████████████████████████████████████████████████████████████| 434/434 [00:00<00:00, 433kB/s]
Downloading: 100%|████████████████████████████████████████████████████████████████████| 1.35G/1.35G [01:37<00:00, 13.8MB/s]
from rat-sql.
I don't know How but changing it little work for me
shifted "# Cache the pretrained BERT model" after "# Download & cache StanfordNLP"
FROM pytorch/pytorch:1.5-cuda10.1-cudnn7-devel
ENV LC_ALL=C.UTF-8 \
LANG=C.UTF-8
RUN mkdir -p /usr/share/man/man1 && \
apt-get update && apt-get install -y \
build-essential \
cifs-utils \
curl \
default-jdk \
dialog \
dos2unix \
git \
sudo
# Install app requirements first to avoid invalidating the cache
COPY requirements.txt setup.py /app/
WORKDIR /app
RUN pip install --user -r requirements.txt --no-warn-script-location && \
pip install --user entmax && \
python -c "import nltk; nltk.download('stopwords'); nltk.download('punkt')"
# Cache the pretrained BERT model
#RUN python -c "from transformers import BertModel; BertModel.from_pretrained('bert-large-uncased-whole-word-masking')"
# Download & cache StanfordNLP
RUN mkdir -p /app/third_party && \
cd /app/third_party && \
curl https://nlp.stanford.edu/software/stanford-corenlp-full-2018-10-05.zip | jar xv
# Cache the pretrained BERT model
RUN python -c "from transformers import BertModel; BertModel.from_pretrained('bert-large-uncased-whole-word-masking')"
# Now copy the rest of the app
COPY . /app/
# Assume that the datasets will be mounted as a volume into /mnt/data on startup.
# Symlink the data subdirectory to that volume.
ENV CACHE_DIR=/mnt/data
RUN mkdir -p /mnt/data && \
mkdir -p /app/data && \
cd /app/data && \
ln -snf /mnt/data/spider spider && \
ln -snf /mnt/data/wikisql wikisql
# Convert all shell scripts to Unix line endings, if any
RUN /bin/bash -c 'if compgen -G "/app/**/*.sh" > /dev/null; then dos2unix /app/**/*.sh; fi'
# Extend PYTHONPATH to load WikiSQL dependencies
ENV PYTHONPATH="/app/third_party/wikisql/:${PYTHONPATH}"
ENTRYPOINT bash
from rat-sql.
Related Issues (20)
- How did you generate the heatmap plots for the attentions? HOT 2
- Issue during preprocessing: 'CoreNLP' object has no attribute 'client' HOT 6
- Do you have notebook version for RATSQL ? HOT 5
- Can`t train the model with GPU on a server with RTX3090 HOT 1
- Execution accuracy
- An error in preprocess HOT 3
- preporcessing issue HOT 2
- 0% accuracy during evaluation from trained model. - GLOVE HOT 1
- Decoder vocabulary file not created/found HOT 1
- How to find the 'terminal' value in the questions HOT 1
- The process is killed every 280 steps HOT 2
- Schema Modeling HOT 3
- How to execute my own queries in BERT
- Failed to generate val.jsonl after the preprocess of BERT-version rat-sql
- Trained model
- System Requirements
- 这个模型代码,可以应用与中文的数据么
- Colab Implementation of RatSQL and SPIDER dataset HOT 1
- How to customize for other dataset?
- 如何将数据库列名与给定的query对齐
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from rat-sql.