I'm trying to run udify on some data and have followed the instructions, e.g.
fran@ipek:~/source/udify$ python3.8 predict.py --device -1 udify-model.tar.gz test.0.conllu.input logs/pred.0.conllu --eval_file logs/pred.0.json
2021-01-15 16:27:42,512 - INFO - allennlp.models.archival - loading archive file /home/fran/source/udify from cache at /home/fran/source/udify
2021-01-15 16:27:42,548 - INFO - allennlp.common.registrable - instantiating registered subclass udify_model of <class 'allennlp.models.model.Model'>
2021-01-15 16:27:42,548 - INFO - allennlp.common.params - vocabulary.type = default
2021-01-15 16:27:42,548 - INFO - allennlp.common.registrable - instantiating registered subclass default of <class 'allennlp.data.vocabulary.Vocabulary'>
2021-01-15 16:27:42,548 - INFO - allennlp.data.vocabulary - Loading token dictionary from /home/fran/source/udify/vocabulary.
2021-01-15 16:27:44,391 - INFO - allennlp.common.from_params - instantiating class <class 'allennlp.models.model.Model'> from params {'decoders': {'deps': {'arc_representation_dim': 768, 'dropout': 0.5, 'encoder': {'input_dim': 768, 'type': 'pass_through'}, 'pos_embed_dim': None, 'tag_representation_dim': 256, 'type': 'udify_dependency_decoder'}, 'feats': {'adaptive': True, 'dropout': 0.5, 'encoder': {'input_dim': 768, 'type': 'pass_through'}, 'label_smoothing': 0.03, 'task': 'feats', 'type': 'udify_tag_decoder'}, 'lemmas': {'adaptive': True, 'dropout': 0.5, 'encoder': {'input_dim': 768, 'type': 'pass_through'}, 'label_smoothing': 0.03, 'task': 'lemmas', 'type': 'udify_tag_decoder'}, 'upos': {'dropout': 0.5, 'encoder': {'input_dim': 768, 'type': 'pass_through'}, 'label_smoothing': 0.03, 'task': 'upos', 'type': 'udify_tag_decoder'}}, 'dropout': 0.5, 'encoder': {'input_dim': 768, 'type': 'pass_through'}, 'layer_dropout': 0.08, 'mix_embedding': 12, 'tasks': ['upos', 'feats', 'lemmas', 'deps'], 'text_field_embedder': {'allow_unmatched_keys': True, 'dropout': 0.4, 'embedder_to_indexer_map': {'bert': ['bert', 'bert-offsets']}, 'token_embedders': {'bert': {'bert_config': 'config/archive/bert-base-multilingual-cased/bert_config.json', 'combine_layers': 'all', 'dropout': 0.1, 'layer_dropout': 0.08, 'requires_grad': True, 'type': 'udify-bert-predictor'}}, 'type': 'udify_embedder'}, 'type': 'udify_model', 'word_dropout': 0.1} and extras {'vocab'}
2021-01-15 16:27:44,391 - INFO - allennlp.common.params - model.type = udify_model
2021-01-15 16:27:44,392 - INFO - allennlp.common.from_params - instantiating class <class 'udify.models.udify_model.UdifyModel'> from params {'decoders': {'deps': {'arc_representation_dim': 768, 'dropout': 0.5, 'encoder': {'input_dim': 768, 'type': 'pass_through'}, 'pos_embed_dim': None, 'tag_representation_dim': 256, 'type': 'udify_dependency_decoder'}, 'feats': {'adaptive': True, 'dropout': 0.5, 'encoder': {'input_dim': 768, 'type': 'pass_through'}, 'label_smoothing': 0.03, 'task': 'feats', 'type': 'udify_tag_decoder'}, 'lemmas': {'adaptive': True, 'dropout': 0.5, 'encoder': {'input_dim': 768, 'type': 'pass_through'}, 'label_smoothing': 0.03, 'task': 'lemmas', 'type': 'udify_tag_decoder'}, 'upos': {'dropout': 0.5, 'encoder': {'input_dim': 768, 'type': 'pass_through'}, 'label_smoothing': 0.03, 'task': 'upos', 'type': 'udify_tag_decoder'}}, 'dropout': 0.5, 'encoder': {'input_dim': 768, 'type': 'pass_through'}, 'layer_dropout': 0.08, 'mix_embedding': 12, 'tasks': ['upos', 'feats', 'lemmas', 'deps'], 'text_field_embedder': {'allow_unmatched_keys': True, 'dropout': 0.4, 'embedder_to_indexer_map': {'bert': ['bert', 'bert-offsets']}, 'token_embedders': {'bert': {'bert_config': 'config/archive/bert-base-multilingual-cased/bert_config.json', 'combine_layers': 'all', 'dropout': 0.1, 'layer_dropout': 0.08, 'requires_grad': True, 'type': 'udify-bert-predictor'}}, 'type': 'udify_embedder'}, 'word_dropout': 0.1} and extras {'vocab'}
2021-01-15 16:27:44,392 - INFO - allennlp.common.params - model.tasks = ['upos', 'feats', 'lemmas', 'deps']
2021-01-15 16:27:44,392 - INFO - allennlp.common.from_params - instantiating class <class 'allennlp.modules.text_field_embedders.text_field_embedder.TextFieldEmbedder'> from params {'allow_unmatched_keys': True, 'dropout': 0.4, 'embedder_to_indexer_map': {'bert': ['bert', 'bert-offsets']}, 'token_embedders': {'bert': {'bert_config': 'config/archive/bert-base-multilingual-cased/bert_config.json', 'combine_layers': 'all', 'dropout': 0.1, 'layer_dropout': 0.08, 'requires_grad': True, 'type': 'udify-bert-predictor'}}, 'type': 'udify_embedder'} and extras {'vocab'}
2021-01-15 16:27:44,392 - INFO - allennlp.common.params - model.text_field_embedder.type = udify_embedder
2021-01-15 16:27:44,392 - INFO - allennlp.common.params - model.text_field_embedder.allow_unmatched_keys = True
2021-01-15 16:27:44,392 - INFO - allennlp.common.params - model.text_field_embedder.dropout = 0.4
2021-01-15 16:27:44,392 - INFO - allennlp.common.params - model.text_field_embedder.output_dim = None
2021-01-15 16:27:44,392 - INFO - allennlp.common.params - model.text_field_embedder.sum_embeddings = None
2021-01-15 16:27:44,392 - INFO - allennlp.common.from_params - instantiating class <class 'allennlp.modules.token_embedders.token_embedder.TokenEmbedder'> from params {'bert_config': 'config/archive/bert-base-multilingual-cased/bert_config.json', 'combine_layers': 'all', 'dropout': 0.1, 'layer_dropout': 0.08, 'requires_grad': True, 'type': 'udify-bert-predictor'} and extras {'vocab'}
2021-01-15 16:27:44,393 - INFO - allennlp.common.params - model.text_field_embedder.token_embedders.bert.type = udify-bert-predictor
2021-01-15 16:27:44,393 - INFO - allennlp.common.from_params - instantiating class <class 'udify.modules.bert_pretrained.UdifyPredictionBertEmbedder'> from params {'bert_config': 'config/archive/bert-base-multilingual-cased/bert_config.json', 'combine_layers': 'all', 'dropout': 0.1, 'layer_dropout': 0.08, 'requires_grad': True} and extras {'vocab'}
2021-01-15 16:27:44,393 - INFO - allennlp.common.params - model.text_field_embedder.token_embedders.bert.bert_config = config/archive/bert-base-multilingual-cased/bert_config.json
2021-01-15 16:27:44,393 - INFO - allennlp.common.params - model.text_field_embedder.token_embedders.bert.requires_grad = True
2021-01-15 16:27:44,393 - INFO - allennlp.common.params - model.text_field_embedder.token_embedders.bert.dropout = 0.1
2021-01-15 16:27:44,393 - INFO - allennlp.common.params - model.text_field_embedder.token_embedders.bert.layer_dropout = 0.08
2021-01-15 16:27:44,393 - INFO - allennlp.common.params - model.text_field_embedder.token_embedders.bert.combine_layers = all
2021-01-15 16:27:46,710 - INFO - allennlp.common.from_params - instantiating class <class 'allennlp.modules.seq2seq_encoders.seq2seq_encoder.Seq2SeqEncoder'> from params {'input_dim': 768, 'type': 'pass_through'} and extras {'vocab'}
2021-01-15 16:27:46,710 - INFO - allennlp.common.params - model.encoder.type = pass_through
2021-01-15 16:27:46,711 - INFO - allennlp.common.from_params - instantiating class <class 'allennlp.modules.seq2seq_encoders.pass_through_encoder.PassThroughEncoder'> from params {'input_dim': 768} and extras {'vocab'}
2021-01-15 16:27:46,711 - INFO - allennlp.common.params - model.encoder.input_dim = 768
2021-01-15 16:27:46,711 - INFO - allennlp.common.from_params - instantiating class <class 'allennlp.models.model.Model'> from params {'arc_representation_dim': 768, 'dropout': 0.5, 'encoder': {'input_dim': 768, 'type': 'pass_through'}, 'pos_embed_dim': None, 'tag_representation_dim': 256, 'type': 'udify_dependency_decoder'} and extras {'vocab'}
2021-01-15 16:27:46,711 - INFO - allennlp.common.params - model.decoders.deps.type = udify_dependency_decoder
2021-01-15 16:27:46,711 - INFO - allennlp.common.from_params - instantiating class <class 'udify.models.dependency_decoder.DependencyDecoder'> from params {'arc_representation_dim': 768, 'dropout': 0.5, 'encoder': {'input_dim': 768, 'type': 'pass_through'}, 'pos_embed_dim': None, 'tag_representation_dim': 256} and extras {'vocab'}
2021-01-15 16:27:46,711 - INFO - allennlp.common.from_params - instantiating class <class 'allennlp.modules.seq2seq_encoders.seq2seq_encoder.Seq2SeqEncoder'> from params {'input_dim': 768, 'type': 'pass_through'} and extras {'vocab'}
2021-01-15 16:27:46,711 - INFO - allennlp.common.params - model.decoders.deps.encoder.type = pass_through
2021-01-15 16:27:46,711 - INFO - allennlp.common.from_params - instantiating class <class 'allennlp.modules.seq2seq_encoders.pass_through_encoder.PassThroughEncoder'> from params {'input_dim': 768} and extras {'vocab'}
2021-01-15 16:27:46,711 - INFO - allennlp.common.params - model.decoders.deps.encoder.input_dim = 768
2021-01-15 16:27:46,712 - INFO - allennlp.common.params - model.decoders.deps.tag_representation_dim = 256
2021-01-15 16:27:46,712 - INFO - allennlp.common.params - model.decoders.deps.arc_representation_dim = 768
2021-01-15 16:27:46,712 - INFO - allennlp.common.params - model.decoders.deps.pos_embed_dim = None
2021-01-15 16:27:46,712 - INFO - allennlp.common.params - model.decoders.deps.use_mst_decoding_for_validation = True
2021-01-15 16:27:46,712 - INFO - allennlp.common.params - model.decoders.deps.dropout = 0.5
2021-01-15 16:27:46,712 - INFO - allennlp.common.registrable - instantiating registered subclass elu of <class 'allennlp.nn.activations.Activation'>
2021-01-15 16:27:46,718 - INFO - allennlp.common.registrable - instantiating registered subclass linear of <class 'allennlp.nn.activations.Activation'>
2021-01-15 16:27:46,722 - INFO - allennlp.common.registrable - instantiating registered subclass elu of <class 'allennlp.nn.activations.Activation'>
2021-01-15 16:27:46,867 - INFO - udify.models.dependency_decoder - Found POS tags corresponding to the following punctuation : {}. Ignoring words with these POS tags for evaluation.
2021-01-15 16:27:46,867 - INFO - allennlp.nn.initializers - Initializing parameters
2021-01-15 16:27:46,867 - INFO - allennlp.nn.initializers - Done initializing parameters; the following parameters are using their default initialization from their code
2021-01-15 16:27:46,868 - INFO - allennlp.nn.initializers - _head_sentinel
2021-01-15 16:27:46,868 - INFO - allennlp.nn.initializers - arc_attention._bias
2021-01-15 16:27:46,868 - INFO - allennlp.nn.initializers - arc_attention._weight_matrix
2021-01-15 16:27:46,868 - INFO - allennlp.nn.initializers - child_arc_feedforward._linear_layers.0.bias
2021-01-15 16:27:46,868 - INFO - allennlp.nn.initializers - child_arc_feedforward._linear_layers.0.weight
2021-01-15 16:27:46,868 - INFO - allennlp.nn.initializers - child_tag_feedforward._linear_layers.0.bias
2021-01-15 16:27:46,868 - INFO - allennlp.nn.initializers - child_tag_feedforward._linear_layers.0.weight
2021-01-15 16:27:46,868 - INFO - allennlp.nn.initializers - head_arc_feedforward._linear_layers.0.bias
2021-01-15 16:27:46,868 - INFO - allennlp.nn.initializers - head_arc_feedforward._linear_layers.0.weight
2021-01-15 16:27:46,868 - INFO - allennlp.nn.initializers - head_tag_feedforward._linear_layers.0.bias
2021-01-15 16:27:46,868 - INFO - allennlp.nn.initializers - head_tag_feedforward._linear_layers.0.weight
2021-01-15 16:27:46,868 - INFO - allennlp.nn.initializers - tag_bilinear.bias
2021-01-15 16:27:46,868 - INFO - allennlp.nn.initializers - tag_bilinear.weight
2021-01-15 16:27:46,869 - INFO - allennlp.common.from_params - instantiating class <class 'allennlp.models.model.Model'> from params {'adaptive': True, 'dropout': 0.5, 'encoder': {'input_dim': 768, 'type': 'pass_through'}, 'label_smoothing': 0.03, 'task': 'feats', 'type': 'udify_tag_decoder'} and extras {'vocab'}
2021-01-15 16:27:46,869 - INFO - allennlp.common.params - model.decoders.feats.type = udify_tag_decoder
2021-01-15 16:27:46,869 - INFO - allennlp.common.from_params - instantiating class <class 'udify.models.tag_decoder.TagDecoder'> from params {'adaptive': True, 'dropout': 0.5, 'encoder': {'input_dim': 768, 'type': 'pass_through'}, 'label_smoothing': 0.03, 'task': 'feats'} and extras {'vocab'}
2021-01-15 16:27:46,869 - INFO - allennlp.common.params - model.decoders.feats.task = feats
2021-01-15 16:27:46,869 - INFO - allennlp.common.from_params - instantiating class <class 'allennlp.modules.seq2seq_encoders.seq2seq_encoder.Seq2SeqEncoder'> from params {'input_dim': 768, 'type': 'pass_through'} and extras {'vocab'}
2021-01-15 16:27:46,870 - INFO - allennlp.common.params - model.decoders.feats.encoder.type = pass_through
2021-01-15 16:27:46,870 - INFO - allennlp.common.from_params - instantiating class <class 'allennlp.modules.seq2seq_encoders.pass_through_encoder.PassThroughEncoder'> from params {'input_dim': 768} and extras {'vocab'}
2021-01-15 16:27:46,870 - INFO - allennlp.common.params - model.decoders.feats.encoder.input_dim = 768
2021-01-15 16:27:46,870 - INFO - allennlp.common.params - model.decoders.feats.label_smoothing = 0.03
2021-01-15 16:27:46,870 - INFO - allennlp.common.params - model.decoders.feats.dropout = 0.5
2021-01-15 16:27:46,871 - INFO - allennlp.common.params - model.decoders.feats.adaptive = True
2021-01-15 16:27:46,871 - INFO - allennlp.common.params - model.decoders.feats.features = None
2021-01-15 16:27:46,895 - INFO - allennlp.nn.initializers - Initializing parameters
2021-01-15 16:27:46,895 - INFO - allennlp.nn.initializers - Done initializing parameters; the following parameters are using their default initialization from their code
2021-01-15 16:27:46,895 - INFO - allennlp.nn.initializers - task_output.head.weight
2021-01-15 16:27:46,895 - INFO - allennlp.nn.initializers - task_output.tail.0.0.weight
2021-01-15 16:27:46,895 - INFO - allennlp.nn.initializers - task_output.tail.0.1.weight
2021-01-15 16:27:46,895 - INFO - allennlp.nn.initializers - task_output.tail.1.0.weight
2021-01-15 16:27:46,896 - INFO - allennlp.nn.initializers - task_output.tail.1.1.weight
2021-01-15 16:27:46,896 - INFO - allennlp.common.from_params - instantiating class <class 'allennlp.models.model.Model'> from params {'adaptive': True, 'dropout': 0.5, 'encoder': {'input_dim': 768, 'type': 'pass_through'}, 'label_smoothing': 0.03, 'task': 'lemmas', 'type': 'udify_tag_decoder'} and extras {'vocab'}
2021-01-15 16:27:46,896 - INFO - allennlp.common.params - model.decoders.lemmas.type = udify_tag_decoder
2021-01-15 16:27:46,896 - INFO - allennlp.common.from_params - instantiating class <class 'udify.models.tag_decoder.TagDecoder'> from params {'adaptive': True, 'dropout': 0.5, 'encoder': {'input_dim': 768, 'type': 'pass_through'}, 'label_smoothing': 0.03, 'task': 'lemmas'} and extras {'vocab'}
2021-01-15 16:27:46,897 - INFO - allennlp.common.params - model.decoders.lemmas.task = lemmas
2021-01-15 16:27:46,898 - INFO - allennlp.common.from_params - instantiating class <class 'allennlp.modules.seq2seq_encoders.seq2seq_encoder.Seq2SeqEncoder'> from params {'input_dim': 768, 'type': 'pass_through'} and extras {'vocab'}
2021-01-15 16:27:46,898 - INFO - allennlp.common.params - model.decoders.lemmas.encoder.type = pass_through
2021-01-15 16:27:46,898 - INFO - allennlp.common.from_params - instantiating class <class 'allennlp.modules.seq2seq_encoders.pass_through_encoder.PassThroughEncoder'> from params {'input_dim': 768} and extras {'vocab'}
2021-01-15 16:27:46,899 - INFO - allennlp.common.params - model.decoders.lemmas.encoder.input_dim = 768
2021-01-15 16:27:46,899 - INFO - allennlp.common.params - model.decoders.lemmas.label_smoothing = 0.03
2021-01-15 16:27:46,900 - INFO - allennlp.common.params - model.decoders.lemmas.dropout = 0.5
2021-01-15 16:27:46,900 - INFO - allennlp.common.params - model.decoders.lemmas.adaptive = True
2021-01-15 16:27:46,900 - INFO - allennlp.common.params - model.decoders.lemmas.features = None
2021-01-15 16:27:47,014 - INFO - allennlp.nn.initializers - Initializing parameters
2021-01-15 16:27:47,014 - INFO - allennlp.nn.initializers - Done initializing parameters; the following parameters are using their default initialization from their code
2021-01-15 16:27:47,014 - INFO - allennlp.nn.initializers - task_output.head.weight
2021-01-15 16:27:47,015 - INFO - allennlp.nn.initializers - task_output.tail.0.0.weight
2021-01-15 16:27:47,015 - INFO - allennlp.nn.initializers - task_output.tail.0.1.weight
2021-01-15 16:27:47,015 - INFO - allennlp.nn.initializers - task_output.tail.1.0.weight
2021-01-15 16:27:47,015 - INFO - allennlp.nn.initializers - task_output.tail.1.1.weight
2021-01-15 16:27:47,015 - INFO - allennlp.common.from_params - instantiating class <class 'allennlp.models.model.Model'> from params {'dropout': 0.5, 'encoder': {'input_dim': 768, 'type': 'pass_through'}, 'label_smoothing': 0.03, 'task': 'upos', 'type': 'udify_tag_decoder'} and extras {'vocab'}
2021-01-15 16:27:47,015 - INFO - allennlp.common.params - model.decoders.upos.type = udify_tag_decoder
2021-01-15 16:27:47,015 - INFO - allennlp.common.from_params - instantiating class <class 'udify.models.tag_decoder.TagDecoder'> from params {'dropout': 0.5, 'encoder': {'input_dim': 768, 'type': 'pass_through'}, 'label_smoothing': 0.03, 'task': 'upos'} and extras {'vocab'}
2021-01-15 16:27:47,015 - INFO - allennlp.common.params - model.decoders.upos.task = upos
2021-01-15 16:27:47,015 - INFO - allennlp.common.from_params - instantiating class <class 'allennlp.modules.seq2seq_encoders.seq2seq_encoder.Seq2SeqEncoder'> from params {'input_dim': 768, 'type': 'pass_through'} and extras {'vocab'}
2021-01-15 16:27:47,015 - INFO - allennlp.common.params - model.decoders.upos.encoder.type = pass_through
2021-01-15 16:27:47,015 - INFO - allennlp.common.from_params - instantiating class <class 'allennlp.modules.seq2seq_encoders.pass_through_encoder.PassThroughEncoder'> from params {'input_dim': 768} and extras {'vocab'}
2021-01-15 16:27:47,015 - INFO - allennlp.common.params - model.decoders.upos.encoder.input_dim = 768
2021-01-15 16:27:47,016 - INFO - allennlp.common.params - model.decoders.upos.label_smoothing = 0.03
2021-01-15 16:27:47,016 - INFO - allennlp.common.params - model.decoders.upos.dropout = 0.5
2021-01-15 16:27:47,016 - INFO - allennlp.common.params - model.decoders.upos.adaptive = False
2021-01-15 16:27:47,016 - INFO - allennlp.common.params - model.decoders.upos.features = None
2021-01-15 16:27:47,016 - INFO - allennlp.nn.initializers - Initializing parameters
2021-01-15 16:27:47,016 - INFO - allennlp.nn.initializers - Done initializing parameters; the following parameters are using their default initialization from their code
2021-01-15 16:27:47,016 - INFO - allennlp.nn.initializers - task_output._module.bias
2021-01-15 16:27:47,016 - INFO - allennlp.nn.initializers - task_output._module.weight
2021-01-15 16:27:47,017 - INFO - allennlp.common.params - model.dropout = 0.5
2021-01-15 16:27:47,017 - INFO - allennlp.common.params - model.word_dropout = 0.1
2021-01-15 16:27:47,017 - INFO - allennlp.common.params - model.mix_embedding = 12
2021-01-15 16:27:47,017 - INFO - allennlp.common.params - model.layer_dropout = 0.08
2021-01-15 16:27:47,017 - INFO - pytorch_pretrained_bert.tokenization - loading vocabulary file config/archive/bert-base-multilingual-cased/vocab.txt
2021-01-15 16:27:47,258 - INFO - allennlp.nn.initializers - Initializing parameters
2021-01-15 16:27:47,259 - INFO - allennlp.nn.initializers - Done initializing parameters; the following parameters are using their default initialization from their code
2021-01-15 16:27:47,259 - INFO - allennlp.nn.initializers - decoders.deps._head_sentinel
2021-01-15 16:27:47,259 - INFO - allennlp.nn.initializers - decoders.deps.arc_attention._bias
2021-01-15 16:27:47,259 - INFO - allennlp.nn.initializers - decoders.deps.arc_attention._weight_matrix
2021-01-15 16:27:47,259 - INFO - allennlp.nn.initializers - decoders.deps.child_arc_feedforward._linear_layers.0.bias
2021-01-15 16:27:47,259 - INFO - allennlp.nn.initializers - decoders.deps.child_arc_feedforward._linear_layers.0.weight
2021-01-15 16:27:47,259 - INFO - allennlp.nn.initializers - decoders.deps.child_tag_feedforward._linear_layers.0.bias
2021-01-15 16:27:47,259 - INFO - allennlp.nn.initializers - decoders.deps.child_tag_feedforward._linear_layers.0.weight
2021-01-15 16:27:47,259 - INFO - allennlp.nn.initializers - decoders.deps.head_arc_feedforward._linear_layers.0.bias
2021-01-15 16:27:47,259 - INFO - allennlp.nn.initializers - decoders.deps.head_arc_feedforward._linear_layers.0.weight
2021-01-15 16:27:47,259 - INFO - allennlp.nn.initializers - decoders.deps.head_tag_feedforward._linear_layers.0.bias
2021-01-15 16:27:47,259 - INFO - allennlp.nn.initializers - decoders.deps.head_tag_feedforward._linear_layers.0.weight
2021-01-15 16:27:47,259 - INFO - allennlp.nn.initializers - decoders.deps.tag_bilinear.bias
2021-01-15 16:27:47,259 - INFO - allennlp.nn.initializers - decoders.deps.tag_bilinear.weight
2021-01-15 16:27:47,259 - INFO - allennlp.nn.initializers - decoders.feats.task_output.head.weight
2021-01-15 16:27:47,259 - INFO - allennlp.nn.initializers - decoders.feats.task_output.tail.0.0.weight
2021-01-15 16:27:47,259 - INFO - allennlp.nn.initializers - decoders.feats.task_output.tail.0.1.weight
2021-01-15 16:27:47,259 - INFO - allennlp.nn.initializers - decoders.feats.task_output.tail.1.0.weight
2021-01-15 16:27:47,259 - INFO - allennlp.nn.initializers - decoders.feats.task_output.tail.1.1.weight
2021-01-15 16:27:47,259 - INFO - allennlp.nn.initializers - decoders.lemmas.task_output.head.weight
2021-01-15 16:27:47,259 - INFO - allennlp.nn.initializers - decoders.lemmas.task_output.tail.0.0.weight
2021-01-15 16:27:47,259 - INFO - allennlp.nn.initializers - decoders.lemmas.task_output.tail.0.1.weight
2021-01-15 16:27:47,259 - INFO - allennlp.nn.initializers - decoders.lemmas.task_output.tail.1.0.weight
2021-01-15 16:27:47,259 - INFO - allennlp.nn.initializers - decoders.lemmas.task_output.tail.1.1.weight
2021-01-15 16:27:47,259 - INFO - allennlp.nn.initializers - decoders.upos.task_output._module.bias
2021-01-15 16:27:47,259 - INFO - allennlp.nn.initializers - decoders.upos.task_output._module.weight
2021-01-15 16:27:47,259 - INFO - allennlp.nn.initializers - scalar_mix.deps.gamma
2021-01-15 16:27:47,260 - INFO - allennlp.nn.initializers - scalar_mix.deps.scalar_parameters.0
2021-01-15 16:27:47,260 - INFO - allennlp.nn.initializers - scalar_mix.deps.scalar_parameters.1
2021-01-15 16:27:47,260 - INFO - allennlp.nn.initializers - scalar_mix.deps.scalar_parameters.10
2021-01-15 16:27:47,260 - INFO - allennlp.nn.initializers - scalar_mix.deps.scalar_parameters.11
2021-01-15 16:27:47,260 - INFO - allennlp.nn.initializers - scalar_mix.deps.scalar_parameters.2
2021-01-15 16:27:47,260 - INFO - allennlp.nn.initializers - scalar_mix.deps.scalar_parameters.3
2021-01-15 16:27:47,260 - INFO - allennlp.nn.initializers - scalar_mix.deps.scalar_parameters.4
2021-01-15 16:27:47,260 - INFO - allennlp.nn.initializers - scalar_mix.deps.scalar_parameters.5
2021-01-15 16:27:47,260 - INFO - allennlp.nn.initializers - scalar_mix.deps.scalar_parameters.6
2021-01-15 16:27:47,260 - INFO - allennlp.nn.initializers - scalar_mix.deps.scalar_parameters.7
2021-01-15 16:27:47,260 - INFO - allennlp.nn.initializers - scalar_mix.deps.scalar_parameters.8
2021-01-15 16:27:47,260 - INFO - allennlp.nn.initializers - scalar_mix.deps.scalar_parameters.9
2021-01-15 16:27:47,260 - INFO - allennlp.nn.initializers - scalar_mix.feats.gamma
2021-01-15 16:27:47,260 - INFO - allennlp.nn.initializers - scalar_mix.feats.scalar_parameters.0
2021-01-15 16:27:47,260 - INFO - allennlp.nn.initializers - scalar_mix.feats.scalar_parameters.1
2021-01-15 16:27:47,260 - INFO - allennlp.nn.initializers - scalar_mix.feats.scalar_parameters.10
2021-01-15 16:27:47,260 - INFO - allennlp.nn.initializers - scalar_mix.feats.scalar_parameters.11
2021-01-15 16:27:47,260 - INFO - allennlp.nn.initializers - scalar_mix.feats.scalar_parameters.2
2021-01-15 16:27:47,260 - INFO - allennlp.nn.initializers - scalar_mix.feats.scalar_parameters.3
2021-01-15 16:27:47,260 - INFO - allennlp.nn.initializers - scalar_mix.feats.scalar_parameters.4
2021-01-15 16:27:47,260 - INFO - allennlp.nn.initializers - scalar_mix.feats.scalar_parameters.5
2021-01-15 16:27:47,260 - INFO - allennlp.nn.initializers - scalar_mix.feats.scalar_parameters.6
2021-01-15 16:27:47,260 - INFO - allennlp.nn.initializers - scalar_mix.feats.scalar_parameters.7
2021-01-15 16:27:47,260 - INFO - allennlp.nn.initializers - scalar_mix.feats.scalar_parameters.8
2021-01-15 16:27:47,260 - INFO - allennlp.nn.initializers - scalar_mix.feats.scalar_parameters.9
2021-01-15 16:27:47,260 - INFO - allennlp.nn.initializers - scalar_mix.lemmas.gamma
2021-01-15 16:27:47,260 - INFO - allennlp.nn.initializers - scalar_mix.lemmas.scalar_parameters.0
2021-01-15 16:27:47,260 - INFO - allennlp.nn.initializers - scalar_mix.lemmas.scalar_parameters.1
2021-01-15 16:27:47,260 - INFO - allennlp.nn.initializers - scalar_mix.lemmas.scalar_parameters.10
2021-01-15 16:27:47,260 - INFO - allennlp.nn.initializers - scalar_mix.lemmas.scalar_parameters.11
2021-01-15 16:27:47,260 - INFO - allennlp.nn.initializers - scalar_mix.lemmas.scalar_parameters.2
2021-01-15 16:27:47,260 - INFO - allennlp.nn.initializers - scalar_mix.lemmas.scalar_parameters.3
2021-01-15 16:27:47,260 - INFO - allennlp.nn.initializers - scalar_mix.lemmas.scalar_parameters.4
2021-01-15 16:27:47,260 - INFO - allennlp.nn.initializers - scalar_mix.lemmas.scalar_parameters.5
2021-01-15 16:27:47,260 - INFO - allennlp.nn.initializers - scalar_mix.lemmas.scalar_parameters.6
2021-01-15 16:27:47,261 - INFO - allennlp.nn.initializers - scalar_mix.lemmas.scalar_parameters.7
2021-01-15 16:27:47,261 - INFO - allennlp.nn.initializers - scalar_mix.lemmas.scalar_parameters.8
2021-01-15 16:27:47,261 - INFO - allennlp.nn.initializers - scalar_mix.lemmas.scalar_parameters.9
2021-01-15 16:27:47,261 - INFO - allennlp.nn.initializers - scalar_mix.upos.gamma
2021-01-15 16:27:47,261 - INFO - allennlp.nn.initializers - scalar_mix.upos.scalar_parameters.0
2021-01-15 16:27:47,261 - INFO - allennlp.nn.initializers - scalar_mix.upos.scalar_parameters.1
2021-01-15 16:27:47,261 - INFO - allennlp.nn.initializers - scalar_mix.upos.scalar_parameters.10
2021-01-15 16:27:47,261 - INFO - allennlp.nn.initializers - scalar_mix.upos.scalar_parameters.11
2021-01-15 16:27:47,261 - INFO - allennlp.nn.initializers - scalar_mix.upos.scalar_parameters.2
2021-01-15 16:27:47,261 - INFO - allennlp.nn.initializers - scalar_mix.upos.scalar_parameters.3
2021-01-15 16:27:47,261 - INFO - allennlp.nn.initializers - scalar_mix.upos.scalar_parameters.4
2021-01-15 16:27:47,261 - INFO - allennlp.nn.initializers - scalar_mix.upos.scalar_parameters.5
2021-01-15 16:27:47,261 - INFO - allennlp.nn.initializers - scalar_mix.upos.scalar_parameters.6
2021-01-15 16:27:47,261 - INFO - allennlp.nn.initializers - scalar_mix.upos.scalar_parameters.7
2021-01-15 16:27:47,261 - INFO - allennlp.nn.initializers - scalar_mix.upos.scalar_parameters.8
2021-01-15 16:27:47,261 - INFO - allennlp.nn.initializers - scalar_mix.upos.scalar_parameters.9
2021-01-15 16:27:47,261 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.embeddings.LayerNorm.bias
2021-01-15 16:27:47,261 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.embeddings.LayerNorm.weight
2021-01-15 16:27:47,261 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.embeddings.position_embeddings.weight
2021-01-15 16:27:47,261 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.embeddings.token_type_embeddings.weight
2021-01-15 16:27:47,261 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.embeddings.word_embeddings.weight
2021-01-15 16:27:47,261 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.0.attention.output.LayerNorm.bias
2021-01-15 16:27:47,261 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.0.attention.output.LayerNorm.weight
2021-01-15 16:27:47,261 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.0.attention.output.dense.bias
2021-01-15 16:27:47,261 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.0.attention.output.dense.weight
2021-01-15 16:27:47,261 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.0.attention.self.key.bias
2021-01-15 16:27:47,261 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.0.attention.self.key.weight
2021-01-15 16:27:47,261 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.0.attention.self.query.bias
2021-01-15 16:27:47,261 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.0.attention.self.query.weight
2021-01-15 16:27:47,261 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.0.attention.self.value.bias
2021-01-15 16:27:47,261 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.0.attention.self.value.weight
2021-01-15 16:27:47,261 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.0.intermediate.dense.bias
2021-01-15 16:27:47,261 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.0.intermediate.dense.weight
2021-01-15 16:27:47,261 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.0.output.LayerNorm.bias
2021-01-15 16:27:47,261 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.0.output.LayerNorm.weight
2021-01-15 16:27:47,262 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.0.output.dense.bias
2021-01-15 16:27:47,262 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.0.output.dense.weight
2021-01-15 16:27:47,262 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.1.attention.output.LayerNorm.bias
2021-01-15 16:27:47,262 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.1.attention.output.LayerNorm.weight
2021-01-15 16:27:47,262 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.1.attention.output.dense.bias
2021-01-15 16:27:47,262 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.1.attention.output.dense.weight
2021-01-15 16:27:47,262 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.1.attention.self.key.bias
2021-01-15 16:27:47,262 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.1.attention.self.key.weight
2021-01-15 16:27:47,262 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.1.attention.self.query.bias
2021-01-15 16:27:47,262 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.1.attention.self.query.weight
2021-01-15 16:27:47,262 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.1.attention.self.value.bias
2021-01-15 16:27:47,262 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.1.attention.self.value.weight
2021-01-15 16:27:47,262 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.1.intermediate.dense.bias
2021-01-15 16:27:47,262 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.1.intermediate.dense.weight
2021-01-15 16:27:47,262 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.1.output.LayerNorm.bias
2021-01-15 16:27:47,262 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.1.output.LayerNorm.weight
2021-01-15 16:27:47,262 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.1.output.dense.bias
2021-01-15 16:27:47,262 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.1.output.dense.weight
2021-01-15 16:27:47,262 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.10.attention.output.LayerNorm.bias
2021-01-15 16:27:47,262 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.10.attention.output.LayerNorm.weight
2021-01-15 16:27:47,262 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.10.attention.output.dense.bias
2021-01-15 16:27:47,262 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.10.attention.output.dense.weight
2021-01-15 16:27:47,262 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.10.attention.self.key.bias
2021-01-15 16:27:47,262 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.10.attention.self.key.weight
2021-01-15 16:27:47,262 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.10.attention.self.query.bias
2021-01-15 16:27:47,262 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.10.attention.self.query.weight
2021-01-15 16:27:47,262 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.10.attention.self.value.bias
2021-01-15 16:27:47,262 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.10.attention.self.value.weight
2021-01-15 16:27:47,262 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.10.intermediate.dense.bias
2021-01-15 16:27:47,262 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.10.intermediate.dense.weight
2021-01-15 16:27:47,262 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.10.output.LayerNorm.bias
2021-01-15 16:27:47,262 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.10.output.LayerNorm.weight
2021-01-15 16:27:47,262 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.10.output.dense.bias
2021-01-15 16:27:47,262 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.10.output.dense.weight
2021-01-15 16:27:47,262 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.11.attention.output.LayerNorm.bias
2021-01-15 16:27:47,263 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.11.attention.output.LayerNorm.weight
2021-01-15 16:27:47,263 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.11.attention.output.dense.bias
2021-01-15 16:27:47,263 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.11.attention.output.dense.weight
2021-01-15 16:27:47,263 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.11.attention.self.key.bias
2021-01-15 16:27:47,263 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.11.attention.self.key.weight
2021-01-15 16:27:47,263 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.11.attention.self.query.bias
2021-01-15 16:27:47,263 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.11.attention.self.query.weight
2021-01-15 16:27:47,263 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.11.attention.self.value.bias
2021-01-15 16:27:47,263 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.11.attention.self.value.weight
2021-01-15 16:27:47,263 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.11.intermediate.dense.bias
2021-01-15 16:27:47,263 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.11.intermediate.dense.weight
2021-01-15 16:27:47,263 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.11.output.LayerNorm.bias
2021-01-15 16:27:47,263 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.11.output.LayerNorm.weight
2021-01-15 16:27:47,263 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.11.output.dense.bias
2021-01-15 16:27:47,263 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.11.output.dense.weight
2021-01-15 16:27:47,263 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.2.attention.output.LayerNorm.bias
2021-01-15 16:27:47,263 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.2.attention.output.LayerNorm.weight
2021-01-15 16:27:47,263 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.2.attention.output.dense.bias
2021-01-15 16:27:47,263 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.2.attention.output.dense.weight
2021-01-15 16:27:47,263 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.2.attention.self.key.bias
2021-01-15 16:27:47,263 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.2.attention.self.key.weight
2021-01-15 16:27:47,263 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.2.attention.self.query.bias
2021-01-15 16:27:47,263 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.2.attention.self.query.weight
2021-01-15 16:27:47,263 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.2.attention.self.value.bias
2021-01-15 16:27:47,263 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.2.attention.self.value.weight
2021-01-15 16:27:47,263 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.2.intermediate.dense.bias
2021-01-15 16:27:47,263 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.2.intermediate.dense.weight
2021-01-15 16:27:47,263 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.2.output.LayerNorm.bias
2021-01-15 16:27:47,263 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.2.output.LayerNorm.weight
2021-01-15 16:27:47,263 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.2.output.dense.bias
2021-01-15 16:27:47,263 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.2.output.dense.weight
2021-01-15 16:27:47,263 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.3.attention.output.LayerNorm.bias
2021-01-15 16:27:47,263 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.3.attention.output.LayerNorm.weight
2021-01-15 16:27:47,263 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.3.attention.output.dense.bias
2021-01-15 16:27:47,264 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.3.attention.output.dense.weight
2021-01-15 16:27:47,264 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.3.attention.self.key.bias
2021-01-15 16:27:47,264 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.3.attention.self.key.weight
2021-01-15 16:27:47,264 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.3.attention.self.query.bias
2021-01-15 16:27:47,264 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.3.attention.self.query.weight
2021-01-15 16:27:47,264 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.3.attention.self.value.bias
2021-01-15 16:27:47,264 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.3.attention.self.value.weight
2021-01-15 16:27:47,264 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.3.intermediate.dense.bias
2021-01-15 16:27:47,264 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.3.intermediate.dense.weight
2021-01-15 16:27:47,264 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.3.output.LayerNorm.bias
2021-01-15 16:27:47,264 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.3.output.LayerNorm.weight
2021-01-15 16:27:47,264 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.3.output.dense.bias
2021-01-15 16:27:47,264 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.3.output.dense.weight
2021-01-15 16:27:47,264 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.4.attention.output.LayerNorm.bias
2021-01-15 16:27:47,264 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.4.attention.output.LayerNorm.weight
2021-01-15 16:27:47,264 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.4.attention.output.dense.bias
2021-01-15 16:27:47,264 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.4.attention.output.dense.weight
2021-01-15 16:27:47,264 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.4.attention.self.key.bias
2021-01-15 16:27:47,264 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.4.attention.self.key.weight
2021-01-15 16:27:47,264 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.4.attention.self.query.bias
2021-01-15 16:27:47,264 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.4.attention.self.query.weight
2021-01-15 16:27:47,264 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.4.attention.self.value.bias
2021-01-15 16:27:47,264 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.4.attention.self.value.weight
2021-01-15 16:27:47,264 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.4.intermediate.dense.bias
2021-01-15 16:27:47,264 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.4.intermediate.dense.weight
2021-01-15 16:27:47,264 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.4.output.LayerNorm.bias
2021-01-15 16:27:47,264 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.4.output.LayerNorm.weight
2021-01-15 16:27:47,264 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.4.output.dense.bias
2021-01-15 16:27:47,264 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.4.output.dense.weight
2021-01-15 16:27:47,264 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.5.attention.output.LayerNorm.bias
2021-01-15 16:27:47,264 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.5.attention.output.LayerNorm.weight
2021-01-15 16:27:47,264 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.5.attention.output.dense.bias
2021-01-15 16:27:47,264 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.5.attention.output.dense.weight
2021-01-15 16:27:47,264 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.5.attention.self.key.bias
2021-01-15 16:27:47,264 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.5.attention.self.key.weight
2021-01-15 16:27:47,265 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.5.attention.self.query.bias
2021-01-15 16:27:47,265 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.5.attention.self.query.weight
2021-01-15 16:27:47,265 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.5.attention.self.value.bias
2021-01-15 16:27:47,265 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.5.attention.self.value.weight
2021-01-15 16:27:47,265 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.5.intermediate.dense.bias
2021-01-15 16:27:47,265 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.5.intermediate.dense.weight
2021-01-15 16:27:47,265 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.5.output.LayerNorm.bias
2021-01-15 16:27:47,265 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.5.output.LayerNorm.weight
2021-01-15 16:27:47,265 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.5.output.dense.bias
2021-01-15 16:27:47,265 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.5.output.dense.weight
2021-01-15 16:27:47,265 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.6.attention.output.LayerNorm.bias
2021-01-15 16:27:47,265 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.6.attention.output.LayerNorm.weight
2021-01-15 16:27:47,265 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.6.attention.output.dense.bias
2021-01-15 16:27:47,265 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.6.attention.output.dense.weight
2021-01-15 16:27:47,265 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.6.attention.self.key.bias
2021-01-15 16:27:47,265 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.6.attention.self.key.weight
2021-01-15 16:27:47,265 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.6.attention.self.query.bias
2021-01-15 16:27:47,265 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.6.attention.self.query.weight
2021-01-15 16:27:47,265 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.6.attention.self.value.bias
2021-01-15 16:27:47,265 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.6.attention.self.value.weight
2021-01-15 16:27:47,265 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.6.intermediate.dense.bias
2021-01-15 16:27:47,265 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.6.intermediate.dense.weight
2021-01-15 16:27:47,265 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.6.output.LayerNorm.bias
2021-01-15 16:27:47,265 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.6.output.LayerNorm.weight
2021-01-15 16:27:47,265 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.6.output.dense.bias
2021-01-15 16:27:47,265 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.6.output.dense.weight
2021-01-15 16:27:47,265 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.7.attention.output.LayerNorm.bias
2021-01-15 16:27:47,265 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.7.attention.output.LayerNorm.weight
2021-01-15 16:27:47,265 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.7.attention.output.dense.bias
2021-01-15 16:27:47,265 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.7.attention.output.dense.weight
2021-01-15 16:27:47,265 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.7.attention.self.key.bias
2021-01-15 16:27:47,265 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.7.attention.self.key.weight
2021-01-15 16:27:47,265 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.7.attention.self.query.bias
2021-01-15 16:27:47,265 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.7.attention.self.query.weight
2021-01-15 16:27:47,265 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.7.attention.self.value.bias
2021-01-15 16:27:47,266 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.7.attention.self.value.weight
2021-01-15 16:27:47,266 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.7.intermediate.dense.bias
2021-01-15 16:27:47,266 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.7.intermediate.dense.weight
2021-01-15 16:27:47,266 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.7.output.LayerNorm.bias
2021-01-15 16:27:47,266 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.7.output.LayerNorm.weight
2021-01-15 16:27:47,266 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.7.output.dense.bias
2021-01-15 16:27:47,266 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.7.output.dense.weight
2021-01-15 16:27:47,266 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.8.attention.output.LayerNorm.bias
2021-01-15 16:27:47,266 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.8.attention.output.LayerNorm.weight
2021-01-15 16:27:47,266 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.8.attention.output.dense.bias
2021-01-15 16:27:47,266 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.8.attention.output.dense.weight
2021-01-15 16:27:47,266 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.8.attention.self.key.bias
2021-01-15 16:27:47,266 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.8.attention.self.key.weight
2021-01-15 16:27:47,266 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.8.attention.self.query.bias
2021-01-15 16:27:47,266 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.8.attention.self.query.weight
2021-01-15 16:27:47,266 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.8.attention.self.value.bias
2021-01-15 16:27:47,266 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.8.attention.self.value.weight
2021-01-15 16:27:47,266 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.8.intermediate.dense.bias
2021-01-15 16:27:47,266 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.8.intermediate.dense.weight
2021-01-15 16:27:47,266 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.8.output.LayerNorm.bias
2021-01-15 16:27:47,266 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.8.output.LayerNorm.weight
2021-01-15 16:27:47,266 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.8.output.dense.bias
2021-01-15 16:27:47,266 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.8.output.dense.weight
2021-01-15 16:27:47,266 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.9.attention.output.LayerNorm.bias
2021-01-15 16:27:47,266 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.9.attention.output.LayerNorm.weight
2021-01-15 16:27:47,266 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.9.attention.output.dense.bias
2021-01-15 16:27:47,266 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.9.attention.output.dense.weight
2021-01-15 16:27:47,266 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.9.attention.self.key.bias
2021-01-15 16:27:47,266 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.9.attention.self.key.weight
2021-01-15 16:27:47,266 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.9.attention.self.query.bias
2021-01-15 16:27:47,266 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.9.attention.self.query.weight
2021-01-15 16:27:47,267 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.9.attention.self.value.bias
2021-01-15 16:27:47,267 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.9.attention.self.value.weight
2021-01-15 16:27:47,267 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.9.intermediate.dense.bias
2021-01-15 16:27:47,267 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.9.intermediate.dense.weight
2021-01-15 16:27:47,267 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.9.output.LayerNorm.bias
2021-01-15 16:27:47,267 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.9.output.LayerNorm.weight
2021-01-15 16:27:47,267 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.9.output.dense.bias
2021-01-15 16:27:47,267 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.encoder.layer.9.output.dense.weight
2021-01-15 16:27:47,267 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.pooler.dense.bias
2021-01-15 16:27:47,267 - INFO - allennlp.nn.initializers - text_field_embedder.token_embedder_bert.bert_model.pooler.dense.weight
2021-01-15 16:27:47,268 - INFO - udify.models.udify_model - Total number of parameters: 212246786
2021-01-15 16:27:47,268 - INFO - udify.models.udify_model - Total number of trainable parameters: 212246786
Traceback (most recent call last):
File "predict.py", line 59, in <module>
util.predict_and_evaluate_model_with_archive(predictor, params, archive_dir, args.input_file,
File "/home/fran/source/udify/udify/util.py", line 163, in predict_and_evaluate_model_with_archive
predict_model_with_archive(predictor, params, archive, segment_file, pred_file, batch_size)
File "/home/fran/source/udify/udify/util.py", line 142, in predict_model_with_archive
archive = load_archive(archive,
File "/home/fran/.local/lib/python3.8/site-packages/allennlp/models/archival.py", line 227, in load_archive
model = Model.load(config.duplicate(),
File "/home/fran/.local/lib/python3.8/site-packages/allennlp/models/model.py", line 327, in load
return cls.by_name(model_type)._load(config, serialization_dir, weights_file, cuda_device)
File "/home/fran/.local/lib/python3.8/site-packages/allennlp/models/model.py", line 275, in _load
model_state = torch.load(weights_file, map_location=util.device_mapping(cuda_device))
File "/home/fran/.local/lib/python3.8/site-packages/torch/serialization.py", line 529, in load
return _legacy_load(opened_file, map_location, pickle_module, **pickle_load_args)
File "/home/fran/.local/lib/python3.8/site-packages/torch/serialization.py", line 709, in _legacy_load
deserialized_objects[key]._set_from_file(f, offset, f_should_read_directly)
RuntimeError: unexpected EOF, expected 316407350 more bytes. The file might be corrupted.
corrupted double-linked list
Avortat
fran@ipek:~/source/udify$