1. Download Kettle, version 7.1, the official website is too slow, do not download from the official website, refer to the address: https://blog.csdn.net/weixin_43407520/article/details/ 123593474
2. Kettle is a java program, you need to install jdk1.8 first (paired with kettle7.1), reference address: https://blog.csdn.net/weixin_45078706/article/details/115830318
Jdk download address, download jdk-8u341-windows-i586.exe: https://www.oracle.com/java/technologies/downloads/#java8-windows
Install jdk, reference address: https://product. pconline.com.cn/itbk/software/dnwt/1408/5260596.html
3. Start the kettle, find the spoon.bat and open it, and you can log in to the interface successfully
4. Download the jar driver packages of each database, and copy them into the xxx\pdi-ce-7.1.0.0-12\data-integration\lib directory, restart kettle, sqlserver driver package address:
sqljdbc4-2.0.jar: http://www.java2s.com/Code/Jar/s/Downloadsqljdbc430jar.htm
jtds-1.3.1.jar: https://sourceforge.net/projects/jtds/
5. Connect to the sqlserver database, log in to the kettle account and password is admin/admin, convert-right click-new-main object tree-DB connection-new-configure connection-right click connection name-share, reference address:
https://blog.csdn.net/MengCXJ/article/details/123085141
6. Create transform, core object-input-drag table input-edit table input, output-drag insert update-hold shift to connect table input and insert update-edit insert update, run-observe Log – observe the import result
7. Create a job job, file-new-job, core object-drag-in-common start, conversion, dummy, script SQL and other components-press and hold shift to connect each component, open the conversion-select the corresponding created Conversion, open SQL-write corresponding sql script, execute-observe log-observe database result
8. dummy is a placeholder, do nothing, for example, what is the execution result of sql, we can know the execution result of sql after connecting to dummy
9. Open the scheduled task, open start-check repeat-configure repeat time period-execute, observe the execution log-observe the execution result
10. Kettle timing scheduling, reference address:
https://blog.csdn.net/hzp666/article/details/112465876
You can consider using kettleonline, but it costs about 800 yuan, refer to Address:
http://www.kettle.org.cn/