环境&源码筹备

git 源码

登陆https://github.com/apache/hudi

git clone https://github.com/apache/hudi.git

代码编译

可用的maven settings设置(还是会短少一些包)

 <mirrors>     <!-- mirror | Specifies a repository mirror site to use instead of a given       repository. The repository that | this mirror serves has an ID that matches       the mirrorOf element of this mirror. IDs are used | for inheritance and direct       lookup purposes, and must be unique across the set of mirrors. | -->     <mirror>    <id>aliyunmaven</id>    <mirrorOf>*</mirrorOf>    <name>spring-plugin</name>    <url>https://maven.aliyun.com/repository/spring-plugin</url> </mirror>  <mirror>    <id>central</id>    <name>Maven Repository Switchboard</name>    <url>https://repo1.maven.org/maven2/</url>    <mirrorOf>central</mirrorOf></mirror>  </mirrors> 

筹备完结当前能够开始编译

mvn clean package -DskipTests

上面说一说编译遇到的问题
1.首先短少包:
pentaho-aggdesigner-algorithm-5.1.5-jhyde.jar
这个手动下载的,下载地址:
https://public.nexus.pentaho....

2.短少io.confluent包
参考:https://blog.csdn.net/weixin_...
将包补齐即可打包胜利

启动命令

/usr/local/spark/bin/spark-shell \  --jars `ls /Users/##/SourceCode/hudi/hudi/packaging/hudi-spark-bundle/target/hudi-spark-bundle_2.11-*.*.*-SNAPSHOT.jar` \