Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

No module named Tensorframes on EMR installation #113

Open
nikparmar opened this issue Jun 24, 2017 · 6 comments
Open

No module named Tensorframes on EMR installation #113

nikparmar opened this issue Jun 24, 2017 · 6 comments

Comments

@nikparmar
Copy link

Hey I performed the steps to install tensorframes but at the end when Spark shell launches and I run import tensorframes as tf I get no module tensorframes error

@taklwu
Copy link

taklwu commented Jul 17, 2017

I had the same issue when just use pyspark with automatic package installation.

pyspark --packages databricks:tensorframes:0.2.8-s_2.11

So, at the end I decided to build the jars from source and use the jar as extra library, also, you might need to install pandas before running pyspark

PYTHONPATH=tensorframes-assembly-0.2.8.jar \
pyspark --jars tensorframes-assembly-0.2.8.jar

@thunterdb
Copy link
Contributor

@nikparmar can you tell me more about your current spark setup?

@nikparmar
Copy link
Author

@thunterdb So it's an Amazon EMR setup where I was installing this.

@taklwu I will try and update you

@thunterdb
Copy link
Contributor

@nikparmar I am not very familiar with EMR's setup. Did the suggestion above work for you?

@ghost
Copy link

ghost commented Sep 8, 2017

@nikparmar Hello Nik, can you tell me the steps to install tensorframes, i.e. if you have started using it. Thanks,
Kartik

@tanjialiang
Copy link

tanjialiang commented Nov 17, 2017

Hi @thunterdb ,

I have similar issues. I have Spark running in yarn-cluster mode (nodes running CentOS7), version 2.0.1. I added databricks:tensorframes:0.2.9-s_2.11 to spark.jars.packages and it seemed to load the jar according to Spark UI. But I still get no module named tensorframes error when I try to import tensorframes. I have python version 2.7.5 installed across all nodes. I cannot go the other method that sets PYTHONPATH to every node because as I am running Spark in cluster mode, and in this case I will have to maintain it in every single node along with the assemly jars. Is this a bug? If not where do you see the issues are?

Thanks,
Jialiang

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants