Michael Bell

Ph.D. Students Blog

Skip to: Content | Sidebar | Footer

Word Clouds (Swiss-Prot and TrEMBL)

21 March, 2011 (16:37) | Annotation Quality, Miscellaneous, Website | By: Michael Bell

Example word cloud from Swiss-Prot Version 9, with common words and numbers removed

During my analysis of Swiss-Prot and TrEMBL datasets I have extracted all the words from each version of each dataset and counted their occurrences. A neat way of looking at this data is to create word clouds. I have done this for all versions of Swiss-Prot and TrEMBL. These can be seen with common words and numbers removed, just numbers removed or nothing removed. I have created a gallery for each of these:

  1. Common words and numbers removed. Click for the Swiss-Prot or TrEMBL gallery.
  2. Numbers removed. Click for the Swiss-Prot or TrEMBL gallery.
  3. Nothing removed. Click for the Swiss-Prot or TrEMBL gallery.

More information about the word clouds can be found on my website.

Comments

Pingback from An Exercise in Irrelevance » Blog Archive » Introducing Michael Bell’s Blog
Time October 18, 2011 at 2:42 pm

[...] good example of this recent blog post discusses the creation of word clouds for all historical versions of Swiss-Prot and TrEMBL and, because everyone loves a word cloud, it [...]

Write a comment