-
Notifications
You must be signed in to change notification settings - Fork 170
πFlow部署说明V1.4
1.已部署Spark2,Hadoop,Yarn,Hive(可选)
2.JDK1.8
3.scala 2.11.8
4.需使用的端口包括
- 8002: πFlow Server
- 50002: h2db port of πFlow Server
- 6002: πFlow Web service
- 6001: πFlow web Page access address
- 6443: the default https listening port of tomcat
https://github.com/cas-bigdatalab/piflow/releases/tag/v1.4
https://github.com/cas-bigdatalab/piflow/releases/download/v1.4/piflow-server-v1.4.tar.gz
将piflow-server-v1.4.tar.gz解压,如下图所示:
-
bin为πFlow命令行工具;
-
classpath为用户自定开发组件Stop放置路径;
-
config.properties为配置文件;
-
lib为πFlow Server所需jar包;piflow-server-0.9.jar为πFlow Server本身jar
-
logs为πFlow日志目录
-
start.sh、restart.sh、stop.sh、status.sh为πFlow Server启动停止脚本。
-
scala为可编程脚本存放目录
-
sparkJar为spark依赖jar包目录
-
flowFile为流水线及流水线组json存放目录
-
example为流水线及流水线组配置样例
#Spark master and deploy mode
spark.master=yarn
spark.deploy.mode=cluster
#hdfs default file system
fs.defaultFS=hdfs://10.0.85.83:9000
#yarn resourcemanager hostname
yarn.resourcemanager.hostname=10.0.85.83
#if you want to use hive, set hive metastore uris
#hive.metastore.uris=thrift://10.0.85.83:9083
#show data in log, set 0 if you do not show the logs
data.show=10
#monitor the throughput of flow
monitor.throughput=true
#server port
server.port=8001
#h2db port
h2.port=50001
配置集群的环境变量(自定义按需配置)
export JAVA_HOME=/opt/java
export JRE_HOME=/opt/java/jre
export CLASSPATH=.:$JAVA_HOME/lib:$JRE_HOME/lib:$CLASSPATH
export PATH=$JAVA_HOME/bin:$JRE_HOME/bin:$PATH
export HADOOP_HOME=/opt/hadoop-2.6.0
export PATH=$PATH:$HADOOP_HOME/bin:$HADOOP_HOME/sbin
export HIVE_HOME=/opt/apache-hive-2.3.6-bin
export PATH=$PATH:$HIVE_HOME/bin
export SPARK_HOME=/opt/spark-2.1.0-bin-hadoop2.6
export PATH=$PATH:$SPARK_HOME/bin
export SCALA_HOME=/opt/scala-2.11.8
export PATH=$PATH:$SCALA_HOME/bin
export PIFLOW_HOME=/data/piflowServer
export PATH=$PATH:${PIFLOW_HOME}/bin
export DISPLAY=
配置Spark-env.sh文件,环境根据实际需求进行修改
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/usr/local/lib64/python3.6/site-packages/jep/
./start.sh
-
MYSQL5.7
-
JDK1.8
下载安装文件:
https://github.com/cas-bigdatalab/piflow-web/releases/download/v1.4/piflow-web-v1.4.tar.gz
将piflow-web-v1.0.tar.gz解压,如下图所示:
解压后内容说明:
(1)、piflow-tomcat 为piflow-web启动容器。
(2)、config.properties为配置文件。
(3)、start.sh 为启动脚本
(4)、stop.sh 为停止脚本
(5)、status.sh 为查看状态脚本
(6)、restart.sh 为重启脚本
(7)、logs 为日志路径
(8)、storage 为web文件存储路径
(9)、temp_v0.7.sh 为平滑升级的补丁脚本(不需要执行)
config.properties配置文件如下;
如果想使用MySQL数据库则使用这份配置文件
server.servlet.session.timeout=3600
syspara.interfaceUrlHead=http://127.0.0.1:8002
syspara.livyServer=http://127.0.0.1:8998
server.hdfs.url=None
syspara.isIframe=true
# Total maximum value of uploaded files
spring.servlet.multipart.max-request-size=512MB
# Maximum value of a single file
spring.servlet.multipart.max-file-size=512MB
# data source
sysParam.datasource.type=mysql
# MySQL Configuration
#Configure the connection address of MySQL
spring.datasource.url = jdbc:mysql://127.0.0.1:3306/piflow_web?useUnicode=true&characterEncoding=UTF-8&useSSL=false&allowMultiQueries=true&autoReconnect=true&failOverReadOnly=false
#Configure database user name
spring.datasource.username=nature
#Configuration database password
spring.datasource.password=123456
#Configure JDBC Driver
# Can not be configured, according to the URL automatic identification, recommended configuration
spring.datasource.driver-class-name=com.mysql.jdbc.Driver
spring.flyway.locations=classpath:db/flyway-mysql/
# Log Coordination Standard
logging.level.cn.cnic.*.mapper.*=warn
logging.level.root=warn
logging.level.org.flywaydb=warn
logging.level.org.springframework.security=warn
logging.level.org.hibernate.SQL=warn
如果想使用H2DB数据库则使用这份配置文件
server.servlet.session.timeout=3600
syspara.interfaceUrlHead=http://127.0.0.1:8002
syspara.livyServer=http://127.0.0.1:8998
server.hdfs.url=None
syspara.isIframe=true
# Total maximum value of uploaded files
spring.servlet.multipart.max-request-size=512MB
# Maximum value of a single file
spring.servlet.multipart.max-file-size=512MB
# data source
sysParam.datasource.type=h2
# h2 Configuration
#Configure the connection address of H2DB
spring.datasource.url=jdbc:h2:file:/media/nature/linux_disk_0/PiFlow_DB/piflow_web
#Configure database user name
spring.datasource.username=Admin
#Configuration database password
spring.datasource.password=Admin
#Configure JDBC Driver
# Can not be configured, according to the URL automatic identification, recommended configuration
spring.datasource.driver-class-name=org.h2.Driver
##H2DB web console settings
spring.datasource.platform=h2
#After this configuration, h2 web consloe can be accessed remotely. Otherwise it can only be accessed locally.
spring.h2.console.settings.web-allow-others=true
#With this configuration, you can access h2 web consloe through YOUR_URL / h2. YOUR_URL is the access URL of your program.
spring.h2.console.path=/h2
#With this configuration, h2 web consloe will start when the program starts. Of course this is the default. If you #don't want to start h2 web consloe when you start the program, then set it to false.
spring.h2.console.enabled=true
spring.flyway.locations=classpath:db/flyway-h2db/
# Log Coordination Standard
logging.level.cn.cnic.*.mapper.*=warn
logging.level.root=warn
logging.level.org.flywaydb=warn
logging.level.org.springframework.security=warn
logging.level.org.hibernate.SQL=warn
运行
cd piflow-web
./start.sh
访问进行登陆注册:http://serverIp:6001