A lot of interesting techniques involve taking statistical samples, and using those to predict what we'll see in the future. Usually this works pretty well, but when we're dealing with a lot of options or if we have some options that are very rare that approach can go pretty wrong. If we go down the street and note down how many men and women we see, we'll probably be able to use that to predict the chance of the next person we see being male or female pretty well. However, if we were counting all the species of animals we … Read More »

Linear Regression in PHP (part 2)

19 Oct 2011 In: , and

In the last post we had a simple stepping algorithm, and a gradient descent implementation, for fitting a line to a set of points with one variable and one 'outcome'. As I mentioned though, it's fairly straightforward to extend that to multiple variables, and even to curves, rather than just straight lines. Read More »

Linear Regression in PHP

12 Oct 2011 In: , and

I've had a couple of emails recently about the excellent Stanford Machine Learning and AI online classes, so I thought I'd put up the odd post or two on some of the techniques they cover, and what they might look like in PHP. Read More »

Don’t look for meanings, count mentions

7 Oct 2011 In:

It's Ada Lovelace day, giving me a (not often needed) excuse to talk about one of the most interesting people that has worked in information retrieval, Professor Karen Spärck Jones. She worked at the University of Cambridge almost up until her death in 2007, and made significant contributions to natural language processing, machine translation, and particularly to search. Read More »

Understanding XREP in ZeroMQ 2 and 3

3 Oct 2011 In:

The current (and likely for a while) beta version of ZeroMQ, 3, is making some changes that may cause some confusion, particularly around XREP, ROUTER, XREQ and DEALER socket types. I thought I'd briefly breakdown what these types are in the two versions, as much to get it clear in my own head as anything else. Read More »

Pages (7):