Posts Tagged ‘news’

Data Mining helps win a Pulitzer

Sunday, April 18th, 2010

2010 Pulitzer Prize winners for Journalism: Awarded to the Bristol (VA) Herald Courier for the work of Daniel Gilbert in illuminating the murky mismanagement of natural-gas royalties owed to thousands of land owners in southwest Virginia, spurring remedial action by state lawmakers.

Gilbert spend over 13 months to get to the bottom of the issue, and this investigation also  involved data mining knowledge to catch the irregularities. His work uncovered millions in delinquent payments to landowners thanks to a Virginia law that allowed natural gas companies to set up a complicated royalty system that often never meted out money to its rightful owners. It resulted in making changes in this law which are currently being reviewed.

About Pulitzer Prize:
The Pulitzer Prize is a U.S. award for achievements in newspaper journalism, literature and musical composition. Prizes are awarded yearly in twenty-one categories. In twenty of these, each winner receives a certificate and a US$10,000 cash award.

[EditorandPublisher]

  • Share/Bookmark

Can academia projects help businesses? UoA professor says yes!

Monday, March 29th, 2010

Text mining tools can summarize and look for patterns within large electronic documents. Such tools are still expensive and difficult to use on large scale. But a group of researchers, including one’s at the University of Alberta, are hoping to change that.

A University of Alberta professor is helping to create text analysis tools to deeply examine historical trial accounts from the U.K.’s famous Old Bailey criminal court. While the research project is important to academia, the Edmonton-based researcher said that improving the quality of text mining tools could have benefits for businesses as well.

While academia are developing tools like TAPoR, a textual analysis tool that can summarize a body of text, find collocates, identify important dates, and discover the co-occurrences of two target words, the same could be applied to business records as well. Some of the tools in TAPoR use forms of visualization to help researchers grasp the data even clearer.

[IT Canada]

  • Share/Bookmark

Annual survey shows the importance & high spirits towards data mining

Sunday, March 28th, 2010

As a service to the data mining community, RexerAnalytics conducts an annual online survey (started in 2009). It analyzes some factors like experiences, priorities, views and challenges being faced by data mining industry. The Third Annual Data Miner Survey results were announced after studying the reports of 710 respondents from the data mining community. It was concluded that data miners and their organization’s are highly confident and happy with their services and analytic capabilities giving a feedback of  “above average” or “excellent” performances. Most of them even assured that economy conditions will never be a set back or a weak point for them. According to the survey result, the most commonly used and most satisfying primary data mining tools this year are IBM SPSS Modeler (SPSS Clementine), Statistica, and IBM SPSS Statistics (SPSS Statistics). Open source tool Weka is increasingly used by both academic and for-profit data miners. SAS Enterprise Miner dropped in data miner’s tool rankings this year.

Some highlights:

  • 40-item survey of data miners, conducted on-line in early 2009.
  • 710 participants from 58 countries.
  • Data miners’ most commonly used algorithms are regression, decision trees,
    and cluster analysis.
  • Half of data miners say their results are helping to drive strategic
    decisions and operational processes.
  • 58% say they are adding to the knowledge base in the field.
  • 60% of respondents say the results of their modeling are deployed
    always or most of the time.
  • Most data miners feel that the economy will not negatively impact them.
  • Almost half of industry data miners rate the analytic capabilities of their
    company as above average or excellent.  But 19% feel their company has
    minimal or no analytic capabilities.
  • The top challenges facing data miners are dirty data, explaining data mining
    to others, and difficult access to data.  However, in 2009 fewer data miners
    listed data quality and data access as challenges than in the previous year.
  • IBM SPSS Modeler (SPSS Clementine), Statistica, and IBM SPSS Statistics
    (SPSS Statistics) are identified as the “primary tools” used by the most data
    miners.
  • Open-source tools Weka and R made substantial movement up data
    miner’s tool rankings this year, and are now used by large numbers of
    both academic and for-profit data miners.
  • SAS Enterprise Miner dropped in data miner’s tool rankings this year.
  • Users of IBM SPSS Modeler, Statistica, and Rapid Miner are the most
    satisfied with their software.
  • Fields & Industries:  Data mining is everywhere.  The most sited areas are
    CRM / Marketing, Academic, Financial Services, & IT / Telecom.  And in the
    for-profit sector, the departments data miners most frequently work in are
    Marketing & Sales and Research & Development.

[RexterAnalytics]

  • Share/Bookmark

Ex-Yahoo Executives Launch nPario

Friday, March 19th, 2010

nPario, a data mining company that promises to increase ‘insights into customer behavior, with Bassel Ojjeh(Ex-SVP Strategic Data Solutions at Yahoo) as CEO, and Krishna Uppala (Ex- Senior Director/Architect at Yahoo) is the CTO.

nPario plans to offer data products and services that enable companies to understand and market consumer commercial intent.

About nPario Inc.
With operations in North America and the EMEA region, nPariooffers data products and services that enable organizations to understand and market consumer intent. Led by CEO and co-founder Bassel Ojjeh, nPario’s management team has been on the forefront of harnessing the power of data for organizations such as Microsoft, SAS and Yahoo. For additional information visit www.npario.com

[Read More at nPario]

  • Share/Bookmark