Tag Archives: Hieroglyphics

Stepping Carefully

In this post I will continue with my so-called hieroglyphics project.  This project uses a set of image data that describes handwritten characters.  The dataset is frequently used to evaluate machine-learning algorithms.  I’m using the dataset to explore a variety of modelling techniques within JMP.

In my last post I used a script to incrementally add terms to my model so that I could explore the performance of the model with increasing complexity.  But the order in which I added the terms was based on a heuristic and it wasn’t necessarily optimal.  So in this post I want to explore using stepwise regression.

(more…)

Logistic Regression pt.1

In a recent post I created a table that contained two classes of data: images that represent either the handwritten digit ‘5’ or the digit ‘6’.  In this post I’ll model the data using logistic regression.  I will also take the opportunity to look at the role of training and test datasets, and to highlight the distinction between testing and validation.

(more…)

Flippin’ Images

My last post contained a picture of a window that contained a grid of images.  This was a randomly generated array of images based on an extract from the MNIST dataset.  This database contains over 60,000 samples of handwritten digits.

However, my pixel data was disoriented and the images looked more like hieroglyphs.  Fortunately JMP understands an image as an ‘object’, and allows a variety of transformations to be applied to it, including flipping and rotating.

(more…)