-
Notifications
You must be signed in to change notification settings - Fork 14
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Finalized configs for CICD; fixed logging; updated documentation; emb…
…edding:added pooler output as option
- Loading branch information
Geigle
committed
Aug 10, 2021
1 parent
4a577c3
commit d3c047e
Showing
12 changed files
with
151 additions
and
24 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
23 changes: 23 additions & 0 deletions
23
square-model-inference-api/inference_server/.env.bert_adapter
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,23 @@ | ||
# Corresponds to the Huggingface name for finetuned Transformers or the name of a finetuned SentenceTransformers | ||
MODEL_NAME=bert-base-uncased | ||
# Type of the model, e.g. Transformers, Adapter, ... | ||
# See square_model_inference.core.event_handlers.MODEL_MAPPING for all available names with corresponding model | ||
MODEL_TYPE=adapter | ||
|
||
# Disable CUDA even if available | ||
DISABLE_GPU=True | ||
# Batch size used for many inputs | ||
BATCH_SIZE=32 | ||
# Inputs larger than this size are rejected | ||
MAX_INPUT_SIZE=1024 | ||
|
||
# Cache directory where model weights are stored | ||
# This is the name for the env variable used by transformers and sentence-transformers package | ||
TRANSFORMERS_CACHE=/etc/huggingface/.cache/ | ||
|
||
|
||
# Flag that decides if returned numpy arrays are returned | ||
# as lists or encoded to base64 (smaller but not easily human readable). | ||
# See the comment in square_model_inference.models.prediction._encode_numpy on information on how to decode | ||
# the base64 string back to the numpy array | ||
RETURN_PLAINTEXT_ARRAYS=False |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,26 @@ | ||
# Corresponds to the Huggingface name for finetuned Transformers or the name of a finetuned SentenceTransformers | ||
MODEL_NAME=facebook/dpr-question_encoder-single-nq-base | ||
# Type of the model, e.g. Transformers, Adapter, ... | ||
# See square_model_inference.core.event_handlers.MODEL_MAPPING for all available names with corresponding model | ||
MODEL_TYPE=transformer | ||
|
||
# Disable CUDA even if available | ||
DISABLE_GPU=False | ||
# Batch size used for many inputs | ||
BATCH_SIZE=32 | ||
# Inputs larger than this size are rejected | ||
MAX_INPUT_SIZE=1024 | ||
|
||
# Cache directory where model weights are stored | ||
# This is the name for the env variable used by transformers and sentence-transformers package | ||
TRANSFORMERS_CACHE=/etc/huggingface/.cache/ | ||
|
||
# For MODEL_TYPE=transformers: decides the AutoModel* class used | ||
# See square_model_inference.inference.transformer.CLASS_MAPPING for valid names and corresponding class | ||
MODEL_CLASS=base | ||
|
||
# Flag that decides if returned numpy arrays are returned | ||
# as lists or encoded to base64 (smaller but not easily human readable). | ||
# See the comment in square_model_inference.models.prediction._encode_numpy on information on how to decode | ||
# the base64 string back to the numpy array | ||
RETURN_PLAINTEXT_ARRAYS=False |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters