Setup Pentaho ETL server Linux

Posted . Visible to the public.

Todo

  • rotate logs
  • startup script for carte server

Software

  1. Setup centos 6.5
  2. set hostname in /etc/hosts
  3. start sshd
  4. turn off iptables
  5. add EPEL repo
  wget http://dl.fedoraproject.org/pub/epel/6/x86_64/epel-release-6-8.noarch.rpm
  wget http://rpms.famillecollet.com/enterprise/remi-release-6.rpm
  sudo rpm -Uvh remi-release-6*.rpm epel-release-6*.rpm
  1. yum install htop acpi iotop iftop ftop wget man mlocate
  2. create users pentaho and postgres
  3. Java setup package via http://wiki.centos.org/HowTos/JavaOnCentOS
  • download (urgh) from (puke) Oracle
  • yum localinstall jdk-7u51-linux-x64.rpm
  1. home variables
$ vi /etc/profile.d/java.csh
export JAVA_HOME=/usr/java/latest
export PATH=$PATH:/usr/java/latest/bin

PostgrSQL install instructions for Centos from EPEL

  1. https://wiki.postgresql.org/wiki/YUM_Installation Show archive.org snapshot
  2. change pg_hba.conf local auth from peer to ident
  3. connect to standard db

MySQL Java Connector

  1. download dev.mysql.com/downloads/connector/j/‎
  2. find it on public ftp to avoid account login on oracle site
  3. cp mysql-connector-java-5.1.29-bin.jar /opt/pentaho/data-integration/lib/

SETUP business integration community edition

  1. download bi-server-ce http://sourceforge.net/projects/pentaho/files/Business%20Intelligence%20Server/5.0.1-stable/
  2. unpack to /opt/
  3. change owner to pentaho:pentaho
  4. prepare database
  5. run scripts like so:
    psql -U postgres < /opt/pentaho/biserver-ce/data/postgresql/create_*
  6. turn off demo db HSQLDB
  • http://interestingittips.wordpress.com/2011/05/05/complete-pentaho-installation-on-ubuntu-part-2/
  • http://anonymousbi.wordpress.com/2013/12/15/pentaho-bi-server-5-0-1ce-mysql-installation-guide/
  1. in tomcat/webapps/pentaho/WEB-INF/web.xml comment out those parts
    [BEGIN HSQLDB DATABASES] 
    <context-param>
    <param-name>hsqldb-databases</param-name>
    <param-value>sampledata@../../data/hsqldb/sampledata,hibernate@../../data/hsqldb/hibernate,quartz@../../data/hsqldb/quartz</param-value>
    </context-param>
    [END HSQLDB DATABASES] 
    [BEGIN HSQLDB STARTER]
    <listener>
    <listener-class>org.pentaho.platform.web.http.context.HsqldbStartupListener</listener-class>
    </listener>
    [END HSQLDB STARTER] 
  1. add repository configuration as described in "Install BA Server with Your Own BA Repository"
    NB: this manual references a lot of stuff from the enterprise edition (i.e. the quartz db is called di_quartz, but should be quartz only)
  2. solve all Java error trace stack problems
    NB: again, make sure there are no funny community ed. / enterprise ed. confusions, also check postgresql authorization configuration
    • ./start-pentaho.sh

Asides

  • Case sensitive Bug checks for table "QRTZ"
  • ERROR [EmbeddedQuartzSystemListener] EmbeddedQuartzSystemListener.ERROR_0007_SQLERROR
  • org.postgresql.util.PSQLException: ERROR: syntax error at or near "RUNSCRIPT"
  • Service conf on CentOS
    • add to chkconfig for start on boot
    • chkconfig --add service
    • chkconfig --level 234 service on
Toolbox Cash on Go
Last edit
Toolbox Cash on Go
Keywords
ops
Tags
Posted by Toolbox Cash on Go to Toolbox Cash on Go's deck (2015-06-29 12:45)