BlogResearch current status

From PublicWiki
Revision as of 06:29, 25 January 2006 by Jack (talk | contribs)

Jump to: navigation, search

1/24/06 Information diffusion through blogspace seems to be closest to what I would like to do, but they appear to stop short of attempting to predict information flow. They create a model of influence, but stop there. They find influential users by tracking occurences of topics by proper nouns and do so very convincingly.

I also wonder if somehow I should be accounting for general variation in posting frequencies on the bursty code of Kleinberg. Currently there are bursts that are somewhat causes by general increase of posts during the day compared to during the night, Perhaps somehow compressing the time intervals to be the inverse of the total word frequency?


1/23/06

I am currently collecting data-downloading all the current posts from livejournal and saving them in their original form. I have implemented the bursty stream code and have written some code to extract the streams from the post data. Next I plan to automate the process of extracting a word's post frequency data and determine how bursty it is. Hopefully then I will automaticly be able to determine which words are spreading quickly through the blog network.

Blog_Results_1