I need to develop a weblog data mining tool.
Having many sequences of URLs requested in a particular user session (extracted from web application logs), I need to figure out usage patterns and groups (clusters) of website users.
I'm new to Data Mining, and now a lot of Google savvy. Some useful information was found, that is, the query "Frequency of development of patterns in weblog data" seems to indicate almost exactly similar studies.
So my questions are:
- Are there any python based tools that do what I need, or at least something like that?
- Can the orange toolkit provide any help?
- Can any help read the book Programming Collective Intelligence ?
- What is Google, why read, what relatively simple algorithms are best used?
I am very limited in time (up to about a week), so any help would be extremely valuable. I need to point me in the right direction and advice on how to complete the task as soon as possible.
Thanks in advance!
source
share