<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>context analytics&#187; Analysts</title>
	<atom:link href="http://context-analytics.com/tag/analysts/feed/" rel="self" type="application/rss+xml" />
	<link>http://context-analytics.com</link>
	<description>Just another WordPress weblog</description>
	<lastBuildDate>Mon, 30 Aug 2010 17:04:40 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<xhtml:meta xmlns:xhtml="http://www.w3.org/1999/xhtml" name="robots" content="noindex" />
		<item>
		<title>Why analytics companies should stop focusing on &#8220;accuracy&#8221; in automated sentiment analysis</title>
		<link>http://context-analytics.com/2010/04/26/why-analytics-companies-should-stop-focusing-on-accuracy-in-automated-sentiment-analysis/</link>
		<comments>http://context-analytics.com/2010/04/26/why-analytics-companies-should-stop-focusing-on-accuracy-in-automated-sentiment-analysis/#comments</comments>
		<pubDate>Mon, 26 Apr 2010 19:39:14 +0000</pubDate>
		<dc:creator>Seth Duncan</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[Analysts]]></category>
		<category><![CDATA[Automated Sentiment]]></category>

		<guid isPermaLink="false">http://context-analytics.com/?p=477</guid>
		<description><![CDATA[Discussions on automated sentiment analysis “accuracy” are starting to border on the bizarre. In the past couple of weeks, I’ve read claims that SAS’s new tool can identify sentiment “better than most humans”. Just a few days later, I read a post this week claiming that ”sentiment analysis [is] best done by humans”.
At the heart of this ongoing debate (and confusion) surrounding automated sentiment analysis is the issue of  “accuracy”– [...]]]></description>
			<content:encoded><![CDATA[<p>Discussions on automated sentiment analysis “accuracy” are starting to border on the bizarre. In the past couple of weeks, I’ve read claims that SAS’s new tool can identify sentiment “better than most humans”. Just a few days later, I read a <a href="http://www.webmetricsguru.com/archives/2010/04/sentiment-analysis-best-done-by-humans/">post</a> this week claiming that ”sentiment analysis [is] best done by humans”.</p>
<p>At the heart of this ongoing debate (and confusion) surrounding automated sentiment analysis is the issue of  “accuracy”– the degree to which software can correctly extract positive, negative, or neutral tone from text. Using “accuracy” as a criterion for useful sentiment analysis demonstrates a fundamental misunderstanding of what sentiment really is and what &#8220;accuracy&#8221; really means. Unfortunately, this misunderstanding has led media researchers and software programmers to search for ”100% sentiment analysis accuracy”, and distracted our industry from what its real focus should be– understanding how the media influences human behavior. </p>
<p>Automated sentiment analysis will never be accurate. Not 1% accurate, 50% accurate, or 100% accurate. To say that an algorithm or statistical model has “accurately” identified a piece of text as positive, negative or neutral requires that sentiment is a real thing in the text that can be correctly identified, like a person&#8217;s name or a product. The problem is that positive and negative don’t really exist on paper or on a computer monitor. The scientists and philosophers who study sentiment all agree that it only exists as property of the animal nervous system. “Positive” and “negative” are neurological states that evolved to helps organisms avoid stuff that can harm them or to promote behavior that’s likely to nourish and help them propagate. Sentiment is absolutely not something that exists “out there” in the world; it only exists in our perceptions of the world.</p>
<p>Because positivity and negativity doesn’t really exist in blog posts, Tweets, Facebook updates, or New York Times editorials, neither human analysts or software will ever be able to “accurately” extract sentiment from them. What analysts and software can do instead is approximate or guess what a reader’s reaction might be to the text. Accurate identification could only be done by measuring actual reader’s emotional reactions to the text—which would be too costly and time-consuming to do.</p>
<p>You might be thinking that  the distinction between sentiment existing in the external world (e.g., text) vs. the internal world (our brains) is purely academic. But it has serious implications for how marketers, communications professionals, media professionals and software programmers tackle the issue of measuring sentiment in large volumes of text.</p>
<p>One relatively minor implication is that when people talk about measuring “accuracy” in automated sentiment analysis, they’re really referring to “reliability”, or agreement between an analysts’ guess about a readers’ reactions to text post and the sentiment decision made by the automated tool. This is more of a pet peeve of mine than anything else (it’s one thing when marketers or software programmers make this mistake, but researchers should know better; a good guide to distinguishing between the reliability and accuracy in measurement can be found <a href="http://www-stat.stanford.edu/~rag/api/shoeshop.pdf">here</a>).</p>
<p>A much more serious implication is that PR and marketing pros need to stop focusing on “accuracy” (i.e., reliability) and start caring about how humans actually evaluate positivity and negativity in the external world. The latter will be a much better predictor of how the media influences behavior and, ultimately, will be most useful to companies who analyze large quantities of mainstream and social media coverage. Given that the process through which the human brain evaluates external things as positive or negative is only beginning to be understood by philosophers and scientists, I’m not optimistic that software programmers and artificial intelligence folks will be cracking that anytime soon.</p>
<p>Since automated sentiment analysis relies on set rules, nearly always tweaked by human analysts, I&#8217;m sure that reliability rates between a tool and a single analyst can reach 90% and beyond (at least within a single set of text on a specific topic). Still, when it comes to approximating what actual human readers are likely to think of an organization or a product in the media, there&#8217;s good reason to believe that human analysts have machines beat. Here are a few things that we do know about how humans evaluate things as good or bad. In each of these cases, human analysts will be better at simulating what an actual reader would do than an automated sentiment analysis tool:</p>
<p> 1) <strong>Different people are going to have different emotional reactions to text.</strong> This point might seem obvious, but it is almost universally overlooked in media measurement conversations. Depending on who you are, you’re probably going to have a different reaction . The human brain is very good at this perspective switching. Starting at a fairly young age, people can simulate the experiences of other people and make good inferences about their emotions, behaviors, etc.. If you’re at a baseball game, for example, and the hitter for the visiting team makes a winning home run, you can effortlessly recognize that that guy must feel pretty good even though it may have ruined your afternoon.</p>
<p>The practical implication for media researchers is that a single piece of text is likely going to have very different affective or emotional meaning depending on who’s perspective you decide to take.  A person could read the  Tweet, “Legit…apply to this contest. Almost no one has applied, so chances are…you’ll win. <a href="http://www.dell.com/w3">www.dell.com/w3</a>” and quickly infer that the writer has a positive attitude towards the contest but that the marketing folks at Dell will probably have a negative reaction. Similarly, the post, “Did you see the next generation iPhone? It was left on a counter by mistake. Hum” might be read negatively by Apple’s PR team but that iPhone owners will probably have a completely neutral reaction to it. A good analyst is able to quickly and effortlessly take on the perspective of the article or post author, a naïve reader, company representative, potential customers, legislators, competing companies, or investors when reading an article or social media post. I have yet to see a piece of software that can approximate this. </p>
<p>2) <strong>Mood impacts evaluative judgments</strong>. Because goodness and badness aren’t properties of the external world that can be detected, humans have to rely on a variety of  information to make sentiment-based judgments. One key piece of information that people tend to use is their own mood. A host of research has consistently shown that people tend to make mood-congruent judgments about objects in their world. If someone is in a good mood, they tend to rate a range of things, from their own life satisfaction to the taste of food, as being better than if they are in a bad mood. In one research study, <a href="http://www.ncbi.nlm.nih.gov/pubmed/621625">Alice Isen and colleagues</a> experimentally induced positive moods in some people, and then asked them to rate the service records of their household appliances (e.g., washers and dryers, coffee makers, etc.). They found that participants in good moods reported much greater satisfaction with the appliances than everyone else (you’re more likely to really like your coffee maker when you’re having a good day). The implication of this for media researchers is that other news, world events, and even  bad weather will likely to affect whether or not a reader interprets a blog post or news story as negative or positive.</p>
<p>3) <strong>Context matters. </strong>One of the most important thing that media analysts can learn from existing knowledge on how humans evaluate sentiment is that context often determines whether or not people perceive otherwise ambiguous things as being either good or bad. In yet another interesting psychological experiment by<a href="http://www2.bc.edu/~russeljm/publications/JPSP1996.pdf"> James Russell and colleagues</a>, participants were shown <a href="http://www.vanderbilt.edu/exploration/resources/facerecog_sixfaces_800.jpg">pictures of people displaying prototypical emotions, such as happiness, surprise, anger, etc.</a> and were told a story about what the person in the picture had just experienced. The researchers found that the story played a huge role in what emotion the participant rated the face as showing. When told that a woman making a prototypically fearful face had just been made to wait for a table at a restaurant for over an hour despite having a reservation, participants tended to rate the face as showing anger rather than fear. Findings like this suggests that situational cues are yet another piece of information that people use to make evaluative judgments about the outside world.</p>
<p>The practical implication to be taken from this is that other stories, blog posts, and Tweets that have been recently read will impact whether or not someone perceives a piece of media as being positive or negative. Other stories in the same magazine, blog posts preceding the one being analyzed, nearby Tweets in a Twitter feed, etc., should all be considered when determining whether or not readers are going to have a positive or negative reaction to a specific piece of text.</p>
<p>I don&#8217;t mean to suggest that, given the complexities of human evaluation, there&#8217;s no point in trying to improve automated sentiment analysis. But,  I don&#8217;t think that the task will be as easy as many media monitoring software providers would like you to beleive. The human brain is incredibly complex, and getting the output of automated sentiment engines to approximate the emotional reactions of real human readers (e.g., customers, voters, investors, etc.) will be a challenging task. Once these challenges are recognized, however, I&#8217;m sure that automated sentiment analysis will eventually come of age as a useful business tool.</p>
<p><script type="text/javascript">// <![CDATA[
var gaJsHost = (("https:" == document.location.protocol) ? "https://ssl." : "http://www.");
document.write(unescape("%3Cscript src='" + gaJsHost + "google-analytics.com/ga.js' type='text/javascript'%3E%3C/script%3E"));
// ]]&gt;</script><br />
<script type="text/javascript">// <![CDATA[
try {
var pageTracker = _gat._getTracker("UA-433187-1");
pageTracker._trackPageview();
} catch(err) {}
// ]]&gt;</script></p>
]]></content:encoded>
			<wfw:commentRss>http://context-analytics.com/2010/04/26/why-analytics-companies-should-stop-focusing-on-accuracy-in-automated-sentiment-analysis/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Companies Are Starting To Realize That Analysts, Not Software, Drive Insights</title>
		<link>http://context-analytics.com/2009/10/06/companies-are-starting-to-realize-that-analysts-not-software-drive-insights/</link>
		<comments>http://context-analytics.com/2009/10/06/companies-are-starting-to-realize-that-analysts-not-software-drive-insights/#comments</comments>
		<pubDate>Tue, 06 Oct 2009 13:28:15 +0000</pubDate>
		<dc:creator>Seth Duncan</dc:creator>
				<category><![CDATA[Mainstream Media]]></category>
		<category><![CDATA[ROI & Modeling]]></category>
		<category><![CDATA[Social Media]]></category>
		<category><![CDATA[Web Analytics]]></category>
		<category><![CDATA[Analysts]]></category>
		<category><![CDATA[Digital Marketing]]></category>
		<category><![CDATA[Insights]]></category>

		<guid isPermaLink="false">http://demowordpress.netlingshq.com/contextanalytics/companies-are-starting-to-realize-that-analysts-not-software-drive-insights/</guid>
		<description><![CDATA[Forrester and Google announced the results of their study on trends in enterprise web analytics yesterday and the results suggest that companies are starting to realize that analysts, not software, produce useful insights. The study, which surveyed the web analytics decision makers at 198 companies with annual revenues of at least $500 million, found that [...]]]></description>
			<content:encoded><![CDATA[<p>Forrester and Google announced the results of their <a href="http://analytics.blogspot.com/2009/10/appraising-your-investment-in.html">study</a> on trends in enterprise web analytics yesterday and the results suggest that companies are starting to realize that analysts, not software, produce useful insights. The study, which surveyed the web analytics decision makers at 198 companies with annual revenues of at least $500 million, found that 60% of respondents believe that web analytics analysts are more valuable than software and that 52% prefer free tools because it allows them to invest more in people who actually drive insights into the data.<span id="more-204"></span></p>
<p>This is an especially timely study for the PR industry. I’ve seen quite a few indications that PR professionals are waiting for a magical piece of software that will integrate web analytics, social/online media metrics and produce real-time analysis that will help demonstrate both ROI and help optimize future campaigns. This is one area where PR could learn a few lessons from their digital marketing peers who invest a good deal of budget on analysts and database managers to tie together web analytics, demographic, psychographic and ad placement data flexibly to show which ads and placements are most and least effective. My hope is that PR professionals will become similarly inclined to roll their sleeves up, mine these rich sources of data and uncover the sort of marketing strategy and ROI gems that online advertisers have had access to for years.</p>
<p>Although it’s not explicit in the Forrester and Google report, there’s a also lot of budget to be saved when you rely on analysts to interpret web analytics data. Developing software to seamlessly integrate web analytics and social/online media data is incredibly time consuming and expensive (and, most likely, not incredibly flexible). Any analyst with basic knowledge of SQL or how to merge datasets in SPSS or SAS should be able to accomplish this sort of integration in a few hours. That’s probably going to cost a lot less than developing or purchasing an enterprise software solution.</p>
<p>Software is fantastic for storing large sets of data, writing statistical algorithms for testing hypotheses, and visualizing the results. But, when it comes to deriving actionable conclusions and novel insights based on that data, you’re going to need a good analyst.</p>
<p><script type="text/javascript">// <![CDATA[
var gaJsHost = (("https:" == document.location.protocol) ? "https://ssl." : "http://www.");
document.write(unescape("%3Cscript src='" + gaJsHost + "google-analytics.com/ga.js' type='text/javascript'%3E%3C/script%3E"));
// ]]&gt;</script><br />
<script type="text/javascript">// <![CDATA[
try {
var pageTracker = _gat._getTracker("UA-433187-1");
pageTracker._trackPageview();
} catch(err) {}
// ]]&gt;</script></p>
]]></content:encoded>
			<wfw:commentRss>http://context-analytics.com/2009/10/06/companies-are-starting-to-realize-that-analysts-not-software-drive-insights/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
