diff --git a/README.md b/README.md index d547fb5bb..0c3bf6711 100644 --- a/README.md +++ b/README.md @@ -160,6 +160,8 @@ repository by accessing them through their GitHub link, like: _ = mlflow.run( f"{config['main']['components_repository']}/get_data", "main", + version='main', + env_manager="conda", parameters={ "sample": config["etl"]["sample"], "artifact_name": "sample.csv", diff --git a/components/README.md b/components/README.md index f36133205..742813f87 100644 --- a/components/README.md +++ b/components/README.md @@ -3,7 +3,7 @@ In order to run these components you need to have conda (Miniconda or Anaconda) and MLflow installed. Install it with:: - > conda install mlflow=1.14.1 + > conda install mlflow=2.8.1 then run:: diff --git a/components/get_data/conda.yml b/components/get_data/conda.yml index b03c2b43a..c9711150c 100644 --- a/components/get_data/conda.yml +++ b/components/get_data/conda.yml @@ -5,6 +5,7 @@ channels: dependencies: - pip=23.3.1 - requests=2.24.0 + - pyarrow - pip: - mlflow==2.8.1 - wandb==0.16.0 diff --git a/main.py b/main.py index 27a89b1d6..ade94844b 100644 --- a/main.py +++ b/main.py @@ -41,6 +41,7 @@ def go(config: DictConfig): f"{config['main']['components_repository']}/get_data", "main", version='main', + env_manager="conda", parameters={ "sample": config["etl"]["sample"], "artifact_name": "sample.csv",