Skip to content

Commit

Permalink
flyte nlp workflow
Browse files Browse the repository at this point in the history
  • Loading branch information
BerndDoser committed Oct 13, 2024
1 parent d7277b8 commit 0083d32
Show file tree
Hide file tree
Showing 3 changed files with 22 additions and 4 deletions.
2 changes: 1 addition & 1 deletion _quarto.yml
Original file line number Diff line number Diff line change
Expand Up @@ -20,5 +20,5 @@ format:
footer: "ML Workflow Orchestration (Bernd Doser, HITS)"
slide-number: true
highlight-style: a11y
code-block-height: 650px
# code-block-height: 650px
# width: 1300
4 changes: 2 additions & 2 deletions code/streamflow_slurm.yaml
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
workflows:
(same as left)
# same as left

deployments:
ssh-cascade:
Expand All @@ -16,4 +16,4 @@ deployments:
partition: cascade.p
nodes: 1
mem: 1gb
wraps: ssh-cascade
wraps: ssh-cascade
20 changes: 19 additions & 1 deletion flyte.qmd
Original file line number Diff line number Diff line change
Expand Up @@ -49,7 +49,25 @@ def mean(values: List[float]) -> float:

## Flyte Workflows

todo ...
- Workflows doesn’t perform computations it’s used to structure tasks

```python
workflow_outputs = typing.NamedTuple(
"WorkflowOutputs",
simwords=Dict[str, float],
distance=float,
topics=Dict[int, List[str]],
)

@workflow
def nlp_workflow(target_word: str = "computer") -> workflow_outputs:
corpus = generate_processed_corpus()
model_wv = train_word2vec_model(training_data=corpus, hyperparams=Word2VecModelHyperparams())
lda_topics = train_lda_model(corpus=corpus, hyperparams=LDAModelHyperparams())
similar_words = word_similarities(model_ser=model_wv.model, word=target_word)
distance = word_movers_distance(model_ser=model_wv.model)
return similar_words, distance, lda_topics
```


## Containerized Tasks
Expand Down

0 comments on commit 0083d32

Please sign in to comment.