• Select a country/region: United States
  • IBM®
  • Site map

  • Training - worldwide
  • Training
  • Course catalog

Course description: BigInsights Analytics for Programmers - SPVC

  • Add course to my custom catalog
  • Add to my training plan

Overview

  • Special note
  • Audience
  • Prerequisites
  • Skills taught
  • Course outline
  • Machine requirements
List of course details in a data table
Course code 2W651 Skill level Intermediate
Duration 16.0 hours Delivery type Self-paced Virtual Class
Course type Public only    
Public price USD $1,200.00 plus tax    

This is a self-paced online course. The average amount of time to complete this course is 16 hours. Once you receive your access information, you will have 30 days to complete this course.

This is an online course. Please do not make travel arrangements for this course. After you receive confirmation that you are enrolled, you will be sent further instructions.

This course is designed to aid programmers who are working with IBM InfoSphere BigInsights. Writing programs that extract data from unstructured text can be a daunting task. You will learn how to create annotators through the use of IBM's Annotated Query Language (AQL). Analyzing data using Apache Hadoop requires that map / reduce programs be written. You will learn how to use Jaql to create high level programs that are decomposed into Hadoop map / reduce programs. People familiar with the Hadoop technology are aware of other open source products that are used in this environment. This course will give you an overview of Apache Pig, ZooKeeper, and Map / Reduce.

This course does not cover InfoSphere BigSheets. InfoSphere BigSheets is covered in IBM InfoSphere BigInsights Analytics for Business Analysis (DW640)

The self-paced format gives you the opportunity to complete the course at your convenience, at any location, and at your own pace. The course is available 24 hours a day. Once you have accessed the course, instructor help is available via the course forum Monday through Friday; questions will be responded to within 24 hours.

This is the Self-paced version of Classroom course BigInsights Analytics for Programmers (DW651), Instructor-led Online course BigInsights Analytics for Programmers (ILO) (3W651) and Web-based course BigInsights Analytics for Programmers - WBT (1W651).

View this course in other countries

Back to top

Special note

IBM Education Advantage Program Eligibility:

  • Yes - Education Pack - online account

Back to top

Audience

This intermediate course is for anyone with the need to learn AQL and Jaql.

Back to top

Prerequisites

You should have:

  • completed InfoSphere BigInsights Essentials or equivalent knowledge

A programming background would be advantageous especially knowledge of SQL

Back to top

Skills taught

  • Describe the AQL data model
  • Use AQL to create annotators
  • List the different extractors used to create AQL views
  • Describe how Jaql is used in a BigInsights environment
  • List the semantics of the Jaql language
  • Explain how to use SQL in Jaql
  • List the Jaql core operators
  • Describe how Apache Pig can be used in a BigInsights environment
  • Identify the semantics of Pig Latin
  • Explain how ZooKeeper can be used to manage barriers or queues
  • Describe the basics of Map / Reduce programming
  • Compare HBase to a Relational Database Management System

Back to top

Course outline

  • An Introduction to Programming for BigInsights
  • Annotation Query Language
  • Jaql
  • An Introduction to Apache Pig

Agenda:

Day 1

  • Unit 1 - Introduction to Programming for BigInsights
  • Unit 2 - Annotation Query Language
    • Exercise 1 - Create Views using Regular Expressions, Dictionaries, and Splits
  • Unit 2 - Annotation Query Language (cont.)
    • Exercise 2 - Create Views using Part of Speech, Blocks, and Patterns
  • Unit 2 - Annotation Query Language (cont.)
    • Exercise 2 - Create Views using Select Statements and Tables
  • Unit 2 - Annotation Query Language (cont.)
    • Exercise 4 - Develop and AQL Extraction Application

Day 2

  • Exercise 4 - Develop and AQL Extraction Application (continued)
  • Unit 3 - Jaql
    • Exercise 5 - Using Jaql
  • Unit 4 -An Introduction to Apache Pig

Back to top

Machine requirements

HW/SW CONFIGURATION

The minimum hardware and software required to launch the course are:

  • Reliable HIGH-SPEED INTERNET connection (min 200 kbps up and down)
  • Windows 2000 or XP or Vista
  • Computer with soundcard
  • Headset or computer speakers
  • Internet Explorer 5.01 or later, or Firefox 1.0 or later

Network Speed Test

http://clpext.moppssc.com/index.php?option=com_wrapper&view=wrapper&Itemid=8

User: clp

Pass: ibmeduc

For example, a speed test against the server with a slow connection of 140 Kbps download and 28 Kbps upload took 14 minutes to load a 30min recording before the video began. Extrapolate from this result to estimate approximately how fast your network internet access would be.

High-speed broadband internet access is the recommended configuration for this course.

Keyboard Configuration

If you use a different character keyboard, you may experience errors when entering passwords. If possible, change your language/country settings for your keyboard to USA, which allows you to enter characters as in a QWERTY keyboard.

Back to top

My IBM

  • Edit your profile

We're here to help

Easy ways to get the answers you need.

  • or call us at
    Call 1-800-426-8322
    Open M-F 9AM-7PM ET.