On Wednesday the 29th of October I submitted my final report for my Honours project. One years worth of work condensed into a 50 page report. It was a long run, but it was fun.
Next thing on the agenda is some summer research. I’ve received a summer research scholarship in which I perform research for 10 weeks on a topic. My topic that I was given is basically extracting information from Wikipedia and storing it in a logical database known as Cyc. Cyc’s goal (look it up on Wikipedia. Oh the irony) is to create an artificial intelligence that is made up of common sense and can understand human emotion and stuff. I’m still a little hazy on it’s exact definition but I’m sure that I’ll grow to know it well.
And after that is hopefully a PhD in which I can further explore the application of reinforcement learning. I hope to create an agent capable of learning anything from any sources (Cyc, by example, by self-exploration).
I was viewing the results today as I complied them into a LaTeX graph, and it showed that SmartAgent 1.3 was significantly worse than V1.2. Strange, I thought. Hell, it was about as good as 1.1. Then I realised that long ago, I had left a bug in the 1.3 code that got ironed out in 1.4.
This bug has appeared in every version prior to 1.4 so I fixed it and now I have to recollect the results. It shouldn’t be too big a deal, as each agent is pretty fast and the results should be done in a day or so.
It’s been a while. So long that I’ve forgotten what it is that the console and GUI trainers do. I need to recall this for details about them in my final report. Which, if you’re interested is coming along well. Although I don’t have time to dilly-dally as I have little free time at the moment.
Anyway, consoleTrainerJava does this:
Spits out the number of steps per episode and ther total number of steps for all N episodes.
guiTrainerJava does this:
Well, I can’t get it working… Figures. Anyway, from a single, possible misinformed screenshot, it shows the current episode number, as well as the number of steps for this episode and the total steps passed dince stared. It also says the current piece. I’m not sure about this last bit, but I think it also states the number of lines completed. It could be total reward but I’m unsure.