Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create a complete introductory example #241

Open
halvard-cognite opened this issue May 24, 2019 · 4 comments
Open

Create a complete introductory example #241

halvard-cognite opened this issue May 24, 2019 · 4 comments
Assignees

Comments

@halvard-cognite
Copy link

The current documentation takes a lot of knowledge for granted.

I would love it if this repo had a complete example with instructions that I could compile and run somewhere (dataproc for example).

Might build on/copy this:
https://cloud.google.com/dataproc/docs/tutorials/spark-scala

@hakontro
Copy link
Contributor

Thanks for the feedback! You're right, the tutorials are narrowed down to using the data source from an already set up Spark cluster with the library available etc.

Just so I understand correctly - you're requesting a more thorough step-by-step guide for building and deploying Spark with the datasource available, or does this apply to the read/write examples as well?

@halvard-cognite
Copy link
Author

The read/write examples are probably fine.

I'm guessing I'm not going to be the last person with no experience using Spark and Scala that will show up at this repo and want to test something out with data from CDF.

@wjoel
Copy link
Contributor

wjoel commented May 27, 2019

As in a tutorial more about setting things up (installing the data source in Dataproc or other clusters) rather than the usage of it as explained in https://github.com/cognitedata/notebook-examples/blob/master/spark/tutorials/Cognite%20Spark%20data%20source%20tutorial.ipynb ?

@halvard-cognite
Copy link
Author

To be fair I did not find the tutorials until after posting this issue and talking to Emil.
But yes, for me getting a minimal code sample running was the big challenge.
Dependency mgmt in the ecosystem etc.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants