Big Data BI in the cloud
Big data solutiuons is marching into the BI space big time. The space for traditional data warehouses solutions will mos tlikely decrease and big data solutions with web based front-ends will take market shares.
This is s high level overview over some alternatives.
List of solutions
The open source Hadoop stack is dominating this space. These solutions are not very end user friendly but there are popping up miscellaneous solutions, often based on Haddop that adress that issue.
- SiSense - Web front-end. In-memory solution for the back-end, needs to be installed on a local server, cloud options are coming
- KarmaSphere - appliance that neeeds to be installed
- Datameer -
- Platfora - cloud and on-premise solution available
My plan is to find some time the coming week to try these out.
More solutions to check out: * http://www.qubole.com/ * http://www.hapyrus.com/
I've tried to signup for a cloud based trial but haven't received any mail. I've contacted support about this.
- Download - http://download.datameer.com/trial/Datameer-2.1/Datameer-220.127.116.11/Datameer-18.104.22.168.dmg
The personal edition seams to run ok on my mac. The web based interface is intuitive and easy to use. JDBC drivers needs to be setup in order to access relational databses.
An interesting feature is that there is an app market with apps for SalesForce, Zendesk, Web site stats etc.
Installation Guide - http://support.karmasphere.com/customer/portal/articles/792010-trial-vm-configuration?mkt_tok=3RkMMJWWfF9wsRonvK%2FJZKXonjHpfsX56ewrUaW%2FlMI%2F0ER3fOvrPUfGjI4DTcNqI%2BSLDwEYGJlv6SgFSLnBMbhrwLgEWRY%3D
Resource center - http://support.karmasphere.com/customer/portal/articles/768615-karmasphere-resource-center?mkt_tok=3RkMMJWWfF9wsRonvK%2FJZKXonjHpfsX56ewrUaW%2FlMI%2F0ER3fOvrPUfGjI4DTcNqI%2BSLDwEYGJlv6SgFSLnBMbhrwLgEWRY%3D
start a medium size ubuntu 12.04 instance and login
Install Ubuntu desktop. I'm using a NoMachine client to login (VNC etc. can also be used) s
# This cfengine script install what is needed sudo apt-get install -y cfengine3 git cd /etc/cfengine3 sudo git clone git://github.com/colmsjo/ubuntu_desktop.git . sudo chmod -R og=r /etc/cfengine3/roles sudo nano /etc/default/cfengine3 sudo /etc/init.d/cfengine3 restart sudo sh -c "cf-agent --verbose --define phonegap> cfengine-log.txt" && more cfengine-log.txt # This needs to be fixed manually, the cfengine script doesn't sudo nano /etc/ssh/sshd_config ... PasswordAuthentication yes # reboot in order to restart all processes, sshd etc. has changed configuration sudo reboot
The root password needs to be set in order to login with NX
sudo passwd ubuntu
Download Oracle Java 6: http://www.oracle.com/technetwork/java/javase/downloads/jre6downloads-1902815.html
# I'm using a S3 bucket to upload the Java install file sudo apt-get install s3cmd s3cmd --configure s3cmd get ... chmod +x jre-6u45-linux-x64.bin ./jre-6u45-linux-x64.bin mv jre1.6.0_45 java-6-oracle sudo mkdir /usr/lib/jvm sudo mv java-6-oracle /usr/lib/jvm # Update path etc. nano .bashrc export PATH=$PATH:/usr/lib/jvm/java-6-oracle/bin export JAVA_HOME=/usr/lib/jvm/java-6-oracle
Connect with NoMachine
s3cmd get s3://gizur-tmp/ksanalyst_linux_emr.sh chmod +x ksanalyst_linux_emr.sh ./ksanalyst_linux_emr.sh