Blog Word Frequency

July 6th, 2006

I exported all of my blog entries to a single HTML file (676KB of HTML) and decided to run it through TAPoR text analysis to find the words that I most commonly use on the blog using the stop word list to remove the junk.

Top Words

Note I’ve manually removed some of the junk such as numbers from this list.

Words	Counts

Firefox	423It's	378Google	360Use	312New	254Like	230Web	216Blog	202Using	200People	197Time	187Really	176Page	166Quite	158Just	158Version	157Search	156Windows	145I'm	145Used	143Internet	142I've	141Php	139Make	139Way	137Work	133Bit	133Don't	130Probably	126Forum	126Messenger	125Lot	119Users	119Good	116Image	114User	113Browser	113Javascript	113Reflection	112Site	109Code	107Nice	107Explorer	106Website	103Want	101Open	99Mozilla	99Html	97Opera	96Information	93Pretty	92Feature	91File	89Text	88Features	88Does	88Look	87Content	86Need	85Think	85Example	83Script	81Great	80

From this list, it looks like I love waffling on about Firefox and Google, Blogs, People, Time, Windows, the Internet, PHP, forums and messenger. Seems about right.

 
  1. Blog Spam
  2. New Blog Theme
  3. Free version of Microsoft Word
  4. Blog Keywords
  5. Added Categories to Geneone Blog

Trackback URI | Comments RSS

Leave a Reply