PERFORM END-TO-END DATA ANALYSIS IN THE CLOUD - BUILDING AN IOT ECOSYSTEM WITH ARDUINO AND BLUEMIX - MEETUP

Page created by Reginald Zimmerman
 
CONTINUE READING
PERFORM END-TO-END DATA ANALYSIS IN THE CLOUD - BUILDING AN IOT ECOSYSTEM WITH ARDUINO AND BLUEMIX - MEETUP
Dale Mumper
Open Source Analytics Solution Engineer - Industrial
dale.mumper@ibm.com

Perform End-to-End Data Analysis in the Cloud
Building an IoT Ecosystem with Arduino and Bluemix

                                                       © 2016 IBM Corporation
PERFORM END-TO-END DATA ANALYSIS IN THE CLOUD - BUILDING AN IOT ECOSYSTEM WITH ARDUINO AND BLUEMIX - MEETUP
Disclaimer

    © Copyright IBM Corporation 2016. All rights reserved.
    U.S. Government Users Restricted Rights - Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp.

    THE INFORMATION CONTAINED IN THIS PRESENTATION IS PROVIDED FOR INFORMATIONAL PURPOSES ONLY. WHILE EFFORTS WERE
    MADE TO VERIFY THE COMPLETENESS AND ACCURACY OF THE INFORMATION CONTAINED IN THIS PRESENTATION, IT IS PROVIDED
    “AS IS” WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED. IN ADDITION, THIS INFORMATION IS BASED ON IBM'S CURRENT
    PRODUCT PLANS AND STRATEGY, WHICH ARE SUBJECT TO CHANGE BY IBM WITHOUT NOTICE. IBM SHALL NOT BE RESPONSIBLE
    FOR ANY DAMAGES ARISING OUT OF THE USE OF, OR OTHERWISE RELATED TO, THIS PRESENTATION OR ANY OTHER
    DOCUMENTATION. NOTHING CONTAINED IN THIS PRESENTATION IS INTENDED TO, NOR SHALL HAVE THE EFFECT OF, CREATING ANY
    WARRANTIES OR REPRESENTATIONS FROM IBM (OR ITS SUPPLIERS OR LICENSORS), OR ALTERING THE TERMS AND CONDITIONS OF
    ANY AGREEMENT OR LICENSE GOVERNING THE USE OF IBM PRODUCTS AND/OR SOFTWARE.

    IBM's statements regarding its plans, directions, and intent are subject to change or withdrawal without notice at IBM's sole discretion. Information
    regarding potential future products is intended to outline our general product direction and it should not be relied on in making a purchasing decision.
    The information mentioned regarding potential future products is not a commitment, promise, or legal obligation to deliver any material, code or
    functionality. Information about potential future products may not be incorporated into any contract. The development, release, and timing of any
    future features or functionality described for our products remains at our sole discretion.

    IBM, the IBM logo, ibm.com, Information Management, DB2, DB2 Connect, DB2 OLAP Server, pureScale, System Z, Cognos, solidDB, Informix,
    Optim, InfoSphere, and z/OS are trademarks or registered trademarks of International Business Machines Corporation in the United States, other
    countries, or both. If these and other IBM trademarked terms are marked on their first occurrence in this information with a trademark symbol (® or
    ™), these symbols indicate U.S. registered or common law trademarks owned by IBM at the time this information was published. Such trademarks
    may also be registered or common law trademarks in other countries. A current list of IBM trademarks is available on the Web at “Copyright and
    trademark information” at www.ibm.com/legal/copytrade.shtml

    Other company, product, or service names may be trademarks or service marks of others.

2                                                                                                                                       © 2016 IBM Corporation
PERFORM END-TO-END DATA ANALYSIS IN THE CLOUD - BUILDING AN IOT ECOSYSTEM WITH ARDUINO AND BLUEMIX - MEETUP
Agenda

     Bio

     Solution Overview

     Bluemix Overview

     Sensor Board

     NodeRED

     Cloudant

     dashDB

     Data Science Experience

3
     Watson Analytics          © 2016 IBM Corporation
PERFORM END-TO-END DATA ANALYSIS IN THE CLOUD - BUILDING AN IOT ECOSYSTEM WITH ARDUINO AND BLUEMIX - MEETUP
Bio

     Dale Mumper

     IBM Open Source Analytics Solution Engineer

     Consultant and analytics leader for over 20 years

     Background in physics and math

     Certifications
      - Cloudera Certified Administrator
        for Apache Hadoop - CCAH
      - Cloudera Certified Developer
        for Apache Hadoop - CCDH
      - Microsoft MCSE – Data Platform
      - Microsoft MCSE – Business Intelligence
      - Oracle Certified Professional - OCP

4                                                         © 2016 IBM Corporation
PERFORM END-TO-END DATA ANALYSIS IN THE CLOUD - BUILDING AN IOT ECOSYSTEM WITH ARDUINO AND BLUEMIX - MEETUP
IoT Defined

 “The network of physical devices, vehicles, building and other items
embedded with electronics, software, sensors, actuators and network
   connectivity that enables object to collect and exchange data.”

              “The infrastructure of the information society.”

    “Every object, device and every familiar part of the traditional home,
                   is being equipped with smart circuitry.”

“With a trillon sensors embedded in the environment—all connected
by computing systems, software and services—it will be possible to
hear the heartbeat of the Earth, impacting human interaction with the
       globe as profoundly as the Internet has revolutionized
                          communications,”

5                                                                © 2016 IBM Corporation
PERFORM END-TO-END DATA ANALYSIS IN THE CLOUD - BUILDING AN IOT ECOSYSTEM WITH ARDUINO AND BLUEMIX - MEETUP
IoT Market Drivers
 USD 157.05 Billion in 2016
 USD 661.74 Billion by 2021
     Compound Annual Growth Rate (CAGR) of 33%
     Impacting all industries
     Industry leaders admit they are lack “clear perspective” on the business opportunities
      afforded in the IoT arena – the trend remains nascent
     2020 could see 30 Billion devices on the global net
 Supplier Attention – open source software and open source hardware,
  development tool kits, major vendor support
 Technological Advances – ARM Cortex (1/10 the power usage), miniaturized
  sensors, declining component costs, faster bandwidth
 Increasing Demand - demand for 1st gen. will increase as costs decline and
  next generations become more advanced; very price sensitive
 Emerging Standards – semiconductor, hardware, networking and software
  companies have joined with a number of industry associations and
  academics consortiums; common APIs

6                                                                                  © 2016 IBM Corporation
PERFORM END-TO-END DATA ANALYSIS IN THE CLOUD - BUILDING AN IOT ECOSYSTEM WITH ARDUINO AND BLUEMIX - MEETUP
Phone Sensor Demo

 Step 1
  • Take out your phone
  • Go to the URL on the card
  • Write down the Device ID
     d:quickstart:phonesensor

 Step 2
  • ibm.biz/iotqstart
  • Enter Device ID

 Step 3
  • Explore
  • Move Phone
     Tilt
     Rotate
     Slow vs. Fast
PERFORM END-TO-END DATA ANALYSIS IN THE CLOUD - BUILDING AN IOT ECOSYSTEM WITH ARDUINO AND BLUEMIX - MEETUP
Environmental Recorder – ER1
   Indoor Environmental Monitoring
    •   Measures and sends data
        Temperature (from three different sensors)
        Humidity
        Air Pressure
        Light Levels
        LEDs provide operational feedback
    • Connects to a local wifi network
    Synchronizes time from an NTP source
    Gets the real IP address and determines geolocation from IP address
    Asks nearest weather station for local forecast
    Connect to an MQTT broker and sends data

8                                                                          © 2016 IBM Corporation
PERFORM END-TO-END DATA ANALYSIS IN THE CLOUD - BUILDING AN IOT ECOSYSTEM WITH ARDUINO AND BLUEMIX - MEETUP
Use Case for the ER1

   Sleep Therapy
   Room Monitoring
   Remote Property
   Easily add sensors and capabilities
    •   UV and IR Sensor
    •   Distance (Ultrasonics and Laser)
    •   Motion
    •   Shock
    •   Vibration
    •   Rotation
    •   Tension and Flex
    •   Soil and Moisture
    •   GPS Module
    •   LTE Cellular W-Fi
    •   Solar Power and Battery
9                                          © 2016 IBM Corporation
PERFORM END-TO-END DATA ANALYSIS IN THE CLOUD - BUILDING AN IOT ECOSYSTEM WITH ARDUINO AND BLUEMIX - MEETUP
Bill of Materials

 Arduino MKR1000                              Parts
     • Atmel ATSAMW25 SoC                       •   LED
         SAMD21 Cortex M0+ ARM MCU              •   220ohm resistor
         WINC1500 2.4GHz 801.11 b/g/n Wi-Fi     •   Full-sized breadboard
         3.3V
                                                •   USB A/MicroB Cable
         256MB Flash
                                                •   Jumper Wires, 3”, MM
         32KB SRAM
                                                •   Jumper Wires, 6”, MM
         Full-Speed USB w/Embedded Host
 Sensors                                      Vendors
     •   Adafruit DS3231                        •   adafruit.com
     •   Adafruit SHT31-D                       •   arduino.cc
     •   Adafruit TSL2691                       •   element14.com
     •   Adafruit BMP183                        •   digikeys.com
     •   Adafruit Neopixels

10                                                                          © 2016 IBM Corporation
IoT Analytics Ecosystem
IoT + Runtime + Cloudant + dashDB + Spark

                  MQTT                                         IBM dashDB

                               IoT Platform                   Schema
                                                             Discovery

                                         REST (HTTP/s) API                  Spark Connector

11                                                                                © 2016 IBM Corporation
Arduino MKR1000
Combines the Arduino Zero and a Wi-Fi Shield at a Great Price Point

 Atmel SAMD21 Cortex-M0+
     •   3.3V
     •   256KB Flash
     •   32KB SRAM
     •   Clock Speed 48MHz
 8 Digital I/O Pins
     • 4 with PWM (pulse width modulated)
    6 Analog Input Pins
    1 Analog Output Pin                                     1.   MPC and Memory
    USB connection                                          2.   Wi-Fi
                                                             3.   Small Form Factor
    Reset button                                            4.   Lower Cost
    Wi-Fi
    Encryption
    Li-Po Battery Charger
12                                                                            © 2016 IBM Corporation
SHT31-D Sensor
    Sensor made by Sensiron
     •    2.5 x 2.5 x 0.9 mm3
     •    temperature range of –40°C to 90°C
     •    ±2% relative humidity and ±0.3°C accuracy
    PCB Board made by Adafruit
     •    3V and 5V compliant
     •    I2C interface
    Power Pins
     •    Vin
           2.5 to 5VDC (Volts Direct Current)
     •    GND
           Common Ground
        I2C Login Pins
     •    SCL
           I2C clock
     •    SDA
           I2C data pin

13                                                    © 2016 IBM Corporation
TSL2561 Sensor

 Sensor made by ams AG
  • Light-to-digital converter
  • 188ulux to 88,000lux
  • Infrared and Full Spectrum diodes
 PCB Board made by Adafruit
     •    3V and 5V compliant
     •    I2C interface
    Power Pins
     •    Vin
           2.5 to 5VDC (Volts Direct Current)
     •    GND
           Common Ground
        I2C Login Pins
     •    SCL
           I2C clock
     •    SDA
           I2C data pin
14                                              © 2016 IBM Corporation
Adafruit DS3231 Real-Time Clock (RTC)

 Chip made by Maxim Integrated
     • DS3231 Real-Time Clock (RTC)
     • Temperature-compensated crystal oscillator and crystal
     • Long-term accuracy
 PCB Board made by Adafruit
  • I2C interface
  • Optional battery maintains time
 Power Pins                                                    z
     •   Vin
     •   GND
 I2C Login Pins
     •   SCL - I2C clock
     •   SDA - I2C data pin

15                                                                  © 2016 IBM Corporation
BMP183 Sensor

 Sensor made by Bosch
  • 300 to 1100hPa (+9000m to -500m)
  • Enhanced GPS, navigation, weather, vert. velocity
 PCB Board made by Adafruit
     •   3V and 5V compliant
     •   SPI interface
    Power Pins
     •   Vin
         2.5 to 5VDC (Volts Direct Current)
     •   GND
         Common Ground
 SPI Logic Pins
     •   SCK - Clock
     •   SDO - Serial Data OUT
     •   SDI - Serial Data IN
     •   CS - Chip Select
16                                                      © 2016 IBM Corporation
NeoPixels == TOTALLY COOL

    Ring
    Jewel
    Strips
    Stick
    Matrix

17                          © 2016 IBM Corporation
Turning Sensors into an IoT Device (ER1)

    Sensors, Clock and LEDs in Review
    Wi-Fi Connectivity
    NTP Client
    Time and Data Handling
    C/C++ Style Floating Point Operations
    HTTP Client
    MQTT Client
    JSON Parsing
    ER1 Sketch Version 3.50
     • Expects to find the IBM_CLASS 2.4GHz, WPA wireless network
        Already has the SSID and the password in the sketch
     • Defaults to using the IBM Watson IoT Platform in Quickstart Mode
     • Sketch automatically determines the Device ID from the MAC
        See your laminated MKR1000 card in your student kit

18                                                                        © 2016 IBM Corporation
Cloud Service Models
                              •   Virtual Servers
                              •   Bare Metal Servers
                       IaaS   •   Network
                              •   Storage
                              •   Load Balancers

                                     •   Database
                                     •   Web Server
                        PaaS         •   Development Tools
                                     •   Runtime Containers

                              •   eMail
                              •   CRM
                       SaaS   •   Games
                              •   Virtual Desktop
19                                                     © 2016 IBM Corporation
Who Does What?

     On-Premise                  IaaS               PaaS                SaaS

      Applications            Applications       Applications        Applications

          Data                    Data               Data                Data

        Runtime                 Runtime            Runtime             Runtime

      Middleware               Middleware        Middleware          Middleware

           OS                      OS                 OS                  OS

      Virtualization          Virtualization     Virtualization      Virtualization

        Servers                 Compute            Compute             Compute

        Storage                 Storage            Storage             Storage

      Networking               Networking        Networking          Networking

                       Managed by Client       Managed by Provider
20                                                                          © 2016 IBM Corporation
IBM Cloud

21          © 2016 IBM Corporation
IBM Bluemix

 Bluemix is an open-standard, cloud-based platform for building, managing, and
  running applications of all types (web, mobile, big data, new smart devices…)

     Go Live in Seconds               APIs and Services                  On-Premises Integration
     Zero to running in one click.    A catalog of IBM, third party,     Build hybrid environments.
     Development plans deploy in      and open source API services       Connect to on-premises
     seconds. Enterprise plans        allow the developer to stitch an   assets plus other public and
     deploy in 1-2 days.              application together in minutes.   private clouds.

     DevOps                           Layered Security                   Flexible Pricing
     Development, monitoring,         IBM secures the platform and       Sign up in minutes. Pay as
     deployment, and logging tools    infrastructure and provides        you go and subscription
     allow the developer to run the   you with the tools to secure       models offer choice and
     entire application.              your apps.                         flexibility.

22                                                                                          © 2016 IBM Corporation
Demo – Bluemix Overview

23                             © 2016 IBM Corporation
We Are Here

              MQTT

24                   © 2016 IBM Corporation
This Is Our Destination
IoT + Runtime + Cloudant + dashDB + Spark

                  MQTT                                         IBM dashDB

                               IoT Platform                   Schema
                                                             Discovery

                                         REST (HTTP/s) API                  Spark Connector

25                                                                                © 2016 IBM Corporation
IBM Watson IoT Starter Platform

1. Catalog > Boilerplates > Internet of Things Platform Starter
2. Fill in Name: 
3. CREATE

 Application is created and staged
     • http://mybluemix.net
     • Creates a Node.js SDK Container
     • Creates a Cloudant NoSQL Database

26                                                                © 2016 IBM Corporation
Node-RED
     A visual tool for wiring the Internet of Things
     •   Browser-based UI for creating flows of events
     •   Deploying action in a light-weight runtime
     •   Based upon node.js
         •       Event-driven, non-blocking model
         •       Flows stored as JSON, so super easy to share
         •       Large library available today
         •       Suitable for server, network, edge and mobile device placement
     •   Open source project on GitHub
     •   IBM is a major contributor
     •   Benefits
         •       Rapid Development
         •       Simple to use with JSON
         •       Simple REST API
         •       Simple MQTT messaging
         •       Contributor Nodes
             •     Simple to use other services

27                                                                                © 2016 IBM Corporation
MQTT

 Machine-to-Machine (M2M)/”Internet of Things” (IoT)
     • Lightweight connectivity protocol for publish/subscribe messaging transport
     • Small code footprint, limited bandwidth, low power usage
     • Minimized packets and efficient distribution to multiple receivers
 MQTT v3.1.1 now an OASIS Standard
     • Invented by Dr. Andy Stanford-Clark (IBM) and Arlen Nipper (Eurotech)
     • MQ Telemetry Transport (ISO/IEC PRF 20922)
 MQTT Broker/Servers
     • IBM Websphere MQ Telemetery, Message Sight, Integration Bus
     • Mosquitto, Eclipse Paho, Europtech Everywhere Device Cloud, emqttd,
       Xively, Moquette, Yunab.io, m2m.io, RabbitMQ, Apache ActiveMQ, HiveMQ
 MQTT Client Methods
     • Connect, Disconnect, Subscribe, Unsubscribe, Publish

28                                                                        © 2016 IBM Corporation
msg.payload
{
"topic": "iot-2/type/MKR1000/id/f8f005f5f8db/evt/status/fmt/json",
"payload": {
"d": { "IBM_IoT_Workshop": "Arduino_MKR1000", "recordType":
"sensorsRead", "DS3231_epoch": 1471003668, "DS3231_date": "08-13-
2016", "DS3231_time": "13:07:48", "DS3231_tempC": 28, "DS3231_tempF":
82.4, "SHT31_tempC": 27.72, "SHT31_tempF": 81.94, "SHT31_humidity":
45.32, "TSL2561_lux": 9, "BMP183_hPa": 1004.22, "BMP183_tempC":
28.08, "BMP183_tempF": 82.55, "BMP183_altStatic": 78.98,
"BMP183_altComputed": 68.09, "local_IP": "192.168.0.170", "mac_addr":
"f8f005f5f8db" }
},
"deviceId": "f8f005f5f8db",
"deviceType": "MKR1000",
"eventType": "status",
"format": "json",
"_msgid": "4a43bc63.b5bc44”
}

29                                                          © 2016 IBM Corporation
Demo – Node-RED

30                     © 2016 IBM Corporation
ER1 Message Payloads

    deviceStart
    ipapiFetch
    localWeather
    sensorRead
    badJSON

 These are all placed into one NoSQL database

31                                               © 2016 IBM Corporation
deviceStart

32            © 2016 IBM Corporation
ipapiFetch

33           © 2016 IBM Corporation
localWeather

34             © 2016 IBM Corporation
sensorRead

35           © 2016 IBM Corporation
Cloudant – NoSQL Database as a Service
     Cloudant delivers a fully-managed database in service to the Analytics, App, and API economy

         Powerful DBaaS
                                                         Operational NoSQL JSON store
                                                         Master-less architecture for
                                                           maximum scalability & availability
 A fully-managed NoSQL database layer that
 can be developed & deployed in days                     Advanced APIs
                                                                REST (HTTPS) API

            Spark                                               Replication & synchronization
         Integration                                            Geo-load balancing
         (Spark SQL)
                                                                Incremental MapReduce indexes
                                                                Military-grade Geospatial indexes
           dashDB
         Integration                                            Lucene full-text search
          (Analytics)
                                                         Offline access to mobile apps & data

36                                                                                         © 2016 IBM Corporation
Demo – Cloudant

37                     © 2016 IBM Corporation
Edge to Warehouse

 Cloudant sits on the Edge of Cloud
     •   Fast, minimal latency, scalable
     •   Transactional
     •   Not the place for long-term storage
     •   Not the place for analytics
 Move IoT data to a warehouse
     • Basic business intelligence
     • Connect to other sources of data
     • The start of analytics journey
 dashDB on Bluemix
     • Data Warehouse as a Service

38                                             © 2016 IBM Corporation
IBM dashDB – Analytics Warehouse as a Service
     In-database analytics capabilities for best performance atop a fully-managed warehouse

                                                       For apps that need:                        for
                                                                                                Analytics
           BLU                                                  •    Elastic scalability
       Acceleration                                             • Highdata
                                                       Fully-managed  availability
                                                                           warehouse on cloud
                                                               • ofData
                                                             Choice       model
                                                                     SoftLayer     flexibility
                                                                               or Amazon  Web Services

          Netezza                                             • Data mobility
                                                        BLU Acceleration columnar technology +
        In-Database                                             •    Text search
          Analytics                                     Netezza in-database analytics
                                                            BLU•in-memory processing, data skipping, actionable
                                                                     Geospatial
                                                             compression, parallel vector processing, “Load & Go”
      Cloudant NoSQL                                         administration
         Integration
                                                       Available as:
                                                            Netezza predictive analytic algorithms
                                                                • Fully managed DBaaS
                                                            Fully integrated RStudio & R language
                                                                • On-premises private cloud
                                                       Oracle compatibility
                                                                • Hybrid architecture
       dashDB MPP                                      Massively Parallel Processing (MPP)
                                                       On disk data encryption and
                                                        secure connectivity
39                                                                                           © 2016 IBM Corporation
Demo – dashDB

40                   © 2016 IBM Corporation
Replicating Cloudant JSON Data into dashDB

 Cloudant’s Schema Discovery Process (SDP) translates JSON documents into
  a schema (or set of tables) that dashDB understands

                                                  SDP maintains continuous
                                                  synchronization from
                                                  Cloudant to dashDB

 41                                                              © 2016 IBM Corporation
Demo – Replication and SQL

42                                © 2016 IBM Corporation
Tailored Experiences For Users Collaborating Together
                                                            Data Engineer
OUTPUT                                      INPUT           Architects how data is
                                                            organized & ensures operability
                   Deliver    Understand                    Data Connect
                 and deploy   problem and
                   model        domain
                                                            Data Scientist
 Communicate                                Ingest          Gets deep into the data to draw
   results                                   data           hidden insights for the business
                                                            Data Science Experience

      Create                                  Explore and   Business Analyst
     and build                                understand    Works with data to apply insights
      model                                      data       to the business strategy
                                                            Watson Analytics

                                    Transform
           Evaluate                                         App Developer
                       Transform:     : clean
                                                            Plugs into data and models &
                         shape                              writes code to build apps
                                                            Bluemix
ANALYSIS
43                                                                              © 2016 IBM Corporation
What is a “Notebook”?
                 Pen and Paper                                     Notebooks
 Pen and paper has long provided the rich       Notebooks are the digital equivalent of the
  experience that scientists need to document     “pen and paper” lab notebook, enabling data
  progress through notes and drawings:            scientists to document reproducible analysis:
     –   Expressive                                   Markdown and visualization
     –   Cumulative                                   Iterative exploration
     –   Collaborative                                Easy to share

44                                                                                  © 2016 IBM Corporation
Web-Based Notebooks…
 Notebooks:
  “interactive computational environment, in which you can combine
      code execution, rich text, mathematics, plots and rich media”

      Jupyter
       • Based on Ipython
       • Supports multiple interpreters
              • Python, Scala, R
      Zeppelin
       • Apache incubator project
       • Supports multiple interpreters
              • Python, Scala, others

            Data Scientist
                  &
             Notebooks

45                                                               © 2016 IBM Corporation
Introducing the Data Science Experience - DSX
Currently in Public Beta

              Learn                    Create                      Collaborate
       Built-in learning to     The best of open source          Community and
       get started or go        and IBM value-add to             social features that
       the distance with        create state-of-the-art          provide meaningful
       advanced tutorials       data products                    collaboration

                                                    Powered by

                              http://datascience.ibm.com

46                                                                                © 2016 IBM Corporation
Core Attributes of the Data Science Experience

                                                                     IBM Data Science Experience

                  Community                        Open Source                       IBM Added Value

     • Find tutorials and datasets    • Code in Scala/Python/R/SQL            • Data Shaping/Pipeline UI *

     • Connect with Data Scientists   • Jupyter and Zeppelin* Notebooks       • Auto-data preparation
                                                                                and modeling*
     • Ask questions                  • RStudio IDE and Shiny apps
                                                                              • Advanced Visualizations*
     • Read articles and papers       • Apache Spark
                                                                              • Model management
     • Fork and share projects        • Your favorite libraries                 and deployment*

                                                                              • Documented Model APIs*

                                                                              • Spark as a Service

                                      Powered by IBM DataWorks in the Cloud

      * DSX product roadmap items
47                                                                                                   © 2016 IBM Corporation
Demo – Data Science Experience

48                              © 2016 IBM Corporation
IBM Watson Analytics - Smart Data Discovery in the Cloud

All the benefits of advanced analytics without the complexity

        Data prep made easy                                    Understand outcomes

          Guided exploration                                        Share insights

      Designed to support the business professional’s analytics process so it’s easy to engage
49                 with and find meanings and patterns in your data in minutes.           © 2016 IBM Corporation
Demo – Watson Analytics

50                             © 2016 IBM Corporation
IBM investment into Apache Spark                                                           "It's like Spark
                                                                                          just got blessed
                                                             Contribute to                by the enterprise
                                                                                                rabbi."
                                                               the Core
                                                                                                Ben Horowitz
                                                               Launch Spark                     Andreessen Horowitz
                                                             Technology Cluster
                                                            (STC), 300 engineers

                Foster                                              Open source          Infuse the
                                                                     SystemML             Portfolio
              Community
                                                          Partner with databricks       Integrate Spark
           Educate 1M+ data                                                           throughout portfolio
        scientists and engineers
           via online courses                                                       3,500 employees working
                                                                                     on Spark-related topics
            Sponsor AMPLab,
              creators and                                                               Spark however
           evangelists of Spark                                                        customers want it –
                                                                                     standalone, platform or
                                                                                            products
51                                                                                                  © 2016 IBM Corporation
     Source: https://www-03.ibm.com/press/us/en/pressrelease/47107.wss
IBM Contributes to core Apache Spark Project

                                  www.spark.tc
     IBM has the largest investment in Spark of any company in the world
                                                      IBM Spark Technology Center

                                                     • Launched in June of 2015

                                                     • Goal to hire 300 Engineers.

                                                     • Goal to Contribute to Apache
                                                       Spark Apache community

                                                     • Contributed SystemML
                                                       technology to Apache community

                                                     • STC continues to grow...
      IBM Spark

52                                                                                © 2016 IBM Corporation
IBM driving SQL and Machine Learning innovation..

http://www.spark.tc/blog/
53                                                  © 2016 IBM Corporation
Foster Community - Free Education
     Big Data University
     http://bigdatauniversity.com/

54                                   © 2016 IBM Corporation
Signup to learn more!

             http://ibm.biz/datagurus

                                    Hands-on
          Webinars      Meetups
                                      Labs

                 Learning
                              Twitter:
                Resources
                            @data_gurus
                  Email

55                                             © 2015 IBM Corporation
Raffle!
        Fill out the paper form
        and drop it in the box.

     Two books being given away!

56                           © 2015 IBM Corporation
 Dale Mumper
 Open Source Analytics Solution Engineer - Industrial
 dale.mumper@ibm.com
57                                                       © 2015 IBM Corporation
You can also read