Search
The Way of the Software Engineer

Archive for the 'machine learning' Category

Applying JOONE to Real-World Data

Posted by admin on June 6th, 2009

JOONE is a toolset used to build and run neural networks in Java.  To demonstrate its capability, I’ve built a simple supervised network and trained it on a common data set used for other machine learning projects.  By using a common data set, comparisons can be made between the different approaches. The data set was [...]

Recently I’ve been playing with a tool set called JOONE.  The goal of the JOONE project is to produce a fast prototyping environment for Neural Nets and a series of libraries to training these networks.  I have so far ignored the prototyping environment, but I do find the libraries quite useful. Two years ago I [...]

Maximizing the Green-ness

Posted by admin on October 12th, 2007

IBM has announced their new energy rating system for computer hardware.  The rating is driven by hardware monitoring systems for cooling and power consumption and can be read in real time.  This looks like a fantastic metric to minimize. David Anderson PE, IBM green consultant said, “A single mainframe running Linux may be able to [...]

Generative simulation patent?

Posted by admin on August 22nd, 2007

NewScientist is reporting on predictive software systems being patented. Doesn’t anyone bother looking for prior art anymore? A simple Google search would show you that wake-sleep learning and Hierarchical Temporal Models (HTM) have been around for years now. I’ve even seen papers discussing their use in predicting human behavior relating to music sales. I’m working [...]

Multipul Experts

Posted by admin on July 3rd, 2007

There exists a model of using several ‘experts’ or classifiers in order to increase the relevancy of a result. In the fish example from Duda, Hart, Stork, a group of experts are asked to determine if a fish is diseased. 9 of them say it’s not, but one disagrees. How do you ask a computer [...]

Semantics IV: Applications

Posted by admin on June 17th, 2007

Web companies – ad revenue based media companies in particular – have amassed great amounts of data on their users. However, they’re data rich and problem poor. Access and error logs are everywhere, but generally they’re not mined for anything more than simple metrics. I’ve used several log analyzers and web trend trackers like HBX [...]

Semantics III: The Core

Posted by admin on June 11th, 2007

At its heart, a targeting engine must make a decision about what ad to send to a website visitor. This can be done at random, by looking at surrounding content, or by looking at history. Most “site ads” are chosen randomly from a short list that’s manually administered by the site owner. The ad rotation [...]

Semantics II: Advertising

Posted by admin on June 8th, 2007

This is part two of a multi-part series. Part one can be found here. Online advertising is a multi-billion dollar business. Google has become one of the worlds most recognized brands by selling ads on its search result pages and putting context ads on websites. Yahoo and Microsoft are working hard, and spending billions to [...]