These slides provide a great overview of BigML's end-to-end workflow for building advanced predictive models, and also highlights the key new features from BigML's Fall 2013 Release.
Human Factors of XR: Using Human Factors to Design XR Systems
BigML Webcast: September 25, 2013
1. BigML Inc, 2013 September 2013
Machine Learning Made Easy
Wednesday, September 25, 13
2. BigML Inc, 2013 September 2013
Today’s Webinar
• Speakers:
• Andrew Shikiar,VP Business Development
• Poul Petersen, CIO
• Enter questions into chat box – we’ll answer some
via text; others at the end of the session
• For direct follow-up, email us at info@bigml.com
2
Wednesday, September 25, 13
3. BigML Inc 3
1 BigML Workflow
2 Live demos
3 Fall 2013 Release
Wednesday, September 25, 13
5. BigML Inc 5
Local Tools
licenseserver vendor storage
sysadmin
upgrades
patches
license
server
vendor
storageupgrades
patches
license
server vendor
storage
upgrades
patches
license
server
vendor
storageupgrades patchessysadmin
Wednesday, September 25, 13
6. BigML Inc 6
“Any fool can make something complicated. It
takes a genius to make it simple.”
― Woody Guthrie
BigML Workflow
Consistent Interface
Auto Scaling
Machine Learning
Storage
Wednesday, September 25, 13
7. BigML Inc 7
Source
Source Dataset Model Evaluation Prediction
Upload
Drag-n-drop, file upload, remote URL, S3, Odata
Change types
Select data for dataset
Numeric, Categorical, Date-Time, Text
Filter rows, un/select fields, choose size
Wednesday, September 25, 13
8. BigML Inc 8
Dataset
Source Dataset Model Evaluation Prediction
Examine
Drag-n-drop, file upload, remote URL, S3, Odata
Configure Model
Select data for model
Fields and sampling
Choose objective field
Model or ensemble
Wednesday, September 25, 13
9. BigML Inc 9
Model
Source Dataset Model Evaluation Prediction
Examine model
Tree view, Filters, Sunburst
Download Model
Share
Rules, JSON, PMML, Python, Ruby, Objective-C
Java, C#, Node.js
Gallery, Private Links
Wednesday, September 25, 13
10. BigML Inc 10
Evaluation
Source Dataset Model Evaluation Prediction
Evaluate Model
Compare to with-held data or new dataset
Traditional Metrics
Compare Evaluations
Accuracy, Precision, Recall, F1, Phi
MAE, MSR, R2
Wednesday, September 25, 13
11. BigML Inc 11
Prediction
Source Dataset Model Evaluation Prediction
Predict
Question by Question
Ensembles
Aggregate by plurality, confidence, or probability
Input new data, get predicted output and confidence
One input field at a time
Wednesday, September 25, 13
12. BigML Inc 12
1 BigML Workflow
2 Live demos
3 Fall 2013 Release
Wednesday, September 25, 13
13. BigML Inc 13
Churn Demo
Objective: Predict Churn
Customers that closed account
Data
Fictional Telecom Company
Wednesday, September 25, 13
16. BigML Inc 16
1 BigML Workflow
2 Live demos
3 Fall 2013 Release
Wednesday, September 25, 13
17. BigML Inc 17
Inline Field Editing
Source Dataset Model Evaluation Prediction
New
Wednesday, September 25, 13
18. BigML Inc 18
Secret Links
Source Dataset Model Evaluation Prediction
New
Wednesday, September 25, 13
19. BigML Inc 19
Confusion Matrix
Source Dataset Model Evaluation Prediction
New
Wednesday, September 25, 13
20. BigML Inc 20
Excel Model Export
Source Dataset Model Evaluation Prediction
New
Wednesday, September 25, 13
21. BigML Inc 21
Multi-label Classification
Source Dataset Model Evaluation Prediction
New
age sex Subjects
22 male Math, Arts, Science
21 female Science
22 female Science, Math
bigmler --train Train.csv
--test Test.csv
--multi-label
--label-separator ","
Wednesday, September 25, 13
22. BigML Inc 22
BigML PredictServer
Source Dataset Model Evaluation Prediction
New
Fast predictions
User deployable
Easy
Wednesday, September 25, 13
23. BigML Inc
Your Cloud / VPC
23
BigML PredictServerNew
predictions
predictions
Your App
✓ Fast
✓ Quick in-memory computation
✓ Low latency
✓ Batch predictions
✓ Reliable - Dedicated instance
✓ Scaleable - scales with CPUs
✓ Easy - API is similar to bigml.io
✓ Secure - Use security groups orVPC
Wednesday, September 25, 13
24. BigML Inc 24
Source Dataset Model Evaluation Prediction
New
Text Analysis
Wednesday, September 25, 13
25. BigML Inc 25
Data Types
numeric
1 2 3
1, 2.0, 3, -5.4 categoricaltrue, yes, red, mammal categoricalcategorical
A B C
DATE-TIME2013-09-25 10:02
DATE-TIME
YEAR
MONTH
DAY-OF-MONTH
YYYY-MM-DD
DAY-OF-WEEK
HOUR
MINUTE
YYYY-MM-DD
YYYY-MM-DD
M-T-W-T-F-S-D
HH:MM:SS
HH:MM:SS
2013
September
25
Wednesday
10
02
text
Be not afraid of greatness:
some are born great, some
achieve greatness, and
some have greatness
thrust upon 'em.
text
“great”
“afraid”
“born”
“some”
appears 2 times
appears 1 time
appears 1 time
appears 2 times
New
Wednesday, September 25, 13
26. BigML Inc 26April, 2013BigML Inc, 2013
Be not afraid of greatness:
some are born great, some
achieve greatness, and
some have greatness
thrust upon 'em.
great: appears 4 times
Text AnalysisNew
Wednesday, September 25, 13
27. BigML Inc 27April, 2013
Evergreen or ephemeral?
Build a classifier to categorize
webpages as evergreen or
ephemeral
http://www.kaggle.com/c/stumbleupon
0
ephemeral
relevant for a short period of time
1
evergreen
maintains a timeless quality and
can be recommended to users
long after it is discovered
Wednesday, September 25, 13
29. BigML Inc
Webinar Recap
• BigML is the only cloud-based tool that provides easy-
to-use machine learning on diverse data types
• Fall 2013 Release is a great leap forward:
• Text analysis
• Multi-label classifications
• BigML PredictServer
• Microsoft Excel model export
• And dozens of workflow updates, including:
• In-line field editing
• Confusion matrix / enhanced Evaluations
• Secret links
32
Wednesday, September 25, 13
30. BigML Inc 37
25% WEBINAR25DISCOUNT
Coupon valid thru OCT/13
FEEDBACK
@bigmlcomTWITTER
info@bigml.com
Get Started Today!
RESOURCES
Join us for future
webinars & hangouts
Wednesday, September 25, 13