CloudBuzz is an innovative tag based data mining and information research tool build completely on Flex & AIR. CloudBuzz pulls data from popular micro-blogging news sources like Twitter, technical feeds like Adobe MXNA, new sources like Google News and thousands of other information sources. It then filters the data using an in build dictionary of words and plots them on a spatial tag cloud which is ranked based on the number of occurrences of a keyword. The application filters approximately 1500 commonly used words and also uses RegEx pattern matching techniques to deduct in-appropriate words which falls within a pattern matching range.
Just to get an idea of the amount of information being processed, at any given minute there are approximately more than 50,000 words (based on the information sources) which are being processed to generate the cloud data. With this application one can easily have a spatial look at the various terms which are making news and also get an idea of how intense the news is. The data-points are color coded based on word density.