<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Software Testing Industry Benchmarks</title>
	<atom:link href="http://mavericktester.com/software-testing-industry-benchmarks/feed" rel="self" type="application/rss+xml" />
	<link>http://mavericktester.com/software-testing-industry-benchmarks</link>
	<description>The startup&#039;s software tester</description>
	<lastBuildDate>Tue, 13 Jul 2010 23:47:40 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: James Bach</title>
		<link>http://mavericktester.com/software-testing-industry-benchmarks#comment-960</link>
		<dc:creator>James Bach</dc:creator>
		<pubDate>Sun, 03 Jan 2010 15:37:36 +0000</pubDate>
		<guid isPermaLink="false">http://mavericktester.com/?p=283#comment-960</guid>
		<description>If you&#039;re going to study a social system (software projects are social systems), then you are engaging in social science. The problem Cem and I have is that the people who push these studies in the computer field so seldom appear to know anything about it. 

Kelvin speaks of &quot;evidence.&quot; He says he wants evidence. No, I think what he wants is an excuse. That&#039;s what bad science does-- it gives reckless, willful people an excuse to believe whatever they like, while asserting with a clear conscious that they were duly diligent.

It seems to me there is a plenty of evidence of a certain sort-- direct experience. Unfortunately, it&#039;s difficult to preserve and recall our direct experiences. When we are asked about them, we remember some of it, misremember other things, misinterpret it, edit it for consumption (so that we don&#039;t look bad or so that other people do), and redact the bits we aren&#039;t permitted to share. And even if we were perfectly honest, our listeners wouldn&#039;t be able to understand or accept all of what we say.

Along comes the survey. What happens next? You might as well answer randomly. That leaves us with fairy tales; mythology. It leaves us with silly ideas like the CMM and TQM and TMAP that hurt us and hinder innovation.

I speak against these because I&#039;m dedicated to real progress. Those of us who care to have made a lot of progress in recent years, not by studying mythical norms, but by learning how to see and think; conferring with each other; and applying the lessons of qualitative research.

I&#039;ve coached lots and lots of testers. I&#039;ve visited a couple hundred companies. The only thing I can say about norms in testing is that it is normal for testers to be confused, untrained, and unable to explain what they do-- and they will try to APPEAR unconfused, competent, and they will make explanations that sound superficially okay, unless you probe a little. That&#039;s normal.

For those interested in helpful science, rather than the pseudo kind, Google the following: ethnomethodology (Garfinkle), symbolic interactionism (Blumer), naturalistic inquiry (Guba), situated action (Suchman), grounded theory (Glaser and Strauss), bounded rationality and heuristics (Simon), the &quot;mangle&quot; of practice (Pickering), sensemaking (Weick), research programs (Lakatos), conjectures and refutations (Popper), paradigms (Kuhn), general systems thinking (Weinberg), radical constructionism (von Glasersfeld), cognitive biases (Kahneman), social life of information (Duguid and Seely Brown).

My reading in these areas, and others, combined with years of seeing people looking for cheap, easy answers and utterly failing, cured me of any urge to look for wisdom in the form of &quot;industry norms.&quot; Instead, let us create a norm called: vigorous self-education.</description>
		<content:encoded><![CDATA[<p>If you&#8217;re going to study a social system (software projects are social systems), then you are engaging in social science. The problem Cem and I have is that the people who push these studies in the computer field so seldom appear to know anything about it. </p>
<p>Kelvin speaks of &#8220;evidence.&#8221; He says he wants evidence. No, I think what he wants is an excuse. That&#8217;s what bad science does&#8211; it gives reckless, willful people an excuse to believe whatever they like, while asserting with a clear conscious that they were duly diligent.</p>
<p>It seems to me there is a plenty of evidence of a certain sort&#8211; direct experience. Unfortunately, it&#8217;s difficult to preserve and recall our direct experiences. When we are asked about them, we remember some of it, misremember other things, misinterpret it, edit it for consumption (so that we don&#8217;t look bad or so that other people do), and redact the bits we aren&#8217;t permitted to share. And even if we were perfectly honest, our listeners wouldn&#8217;t be able to understand or accept all of what we say.</p>
<p>Along comes the survey. What happens next? You might as well answer randomly. That leaves us with fairy tales; mythology. It leaves us with silly ideas like the CMM and TQM and TMAP that hurt us and hinder innovation.</p>
<p>I speak against these because I&#8217;m dedicated to real progress. Those of us who care to have made a lot of progress in recent years, not by studying mythical norms, but by learning how to see and think; conferring with each other; and applying the lessons of qualitative research.</p>
<p>I&#8217;ve coached lots and lots of testers. I&#8217;ve visited a couple hundred companies. The only thing I can say about norms in testing is that it is normal for testers to be confused, untrained, and unable to explain what they do&#8211; and they will try to APPEAR unconfused, competent, and they will make explanations that sound superficially okay, unless you probe a little. That&#8217;s normal.</p>
<p>For those interested in helpful science, rather than the pseudo kind, Google the following: ethnomethodology (Garfinkle), symbolic interactionism (Blumer), naturalistic inquiry (Guba), situated action (Suchman), grounded theory (Glaser and Strauss), bounded rationality and heuristics (Simon), the &#8220;mangle&#8221; of practice (Pickering), sensemaking (Weick), research programs (Lakatos), conjectures and refutations (Popper), paradigms (Kuhn), general systems thinking (Weinberg), radical constructionism (von Glasersfeld), cognitive biases (Kahneman), social life of information (Duguid and Seely Brown).</p>
<p>My reading in these areas, and others, combined with years of seeing people looking for cheap, easy answers and utterly failing, cured me of any urge to look for wisdom in the form of &#8220;industry norms.&#8221; Instead, let us create a norm called: vigorous self-education.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Kelvin Ross</title>
		<link>http://mavericktester.com/software-testing-industry-benchmarks#comment-713</link>
		<dc:creator>Kelvin Ross</dc:creator>
		<pubDate>Tue, 08 Sep 2009 04:15:02 +0000</pubDate>
		<guid isPermaLink="false">http://mavericktester.com/?p=283#comment-713</guid>
		<description>Hi Anne-Marie,

Good post to get some debate going.
 
The goal of the industry benchmark is to collate a profile of how organisation manage software testing and quality.  I have found through consulting with many large organisations that they want to know how others are doing it, they want to know where they are different.  We are trying to provide some visibility into this without having to separately visit each of these organisations yourself.  Also it becomes a basis for recommendation that I may have.  As a consultant, I could just say this is my idea, but if I can justify it with some practice-based evidence, then the stakeholders have confidence that they are not going it alone, bleeding edge, or just some new hair-brained idea of a crazy consultant (or new test manager!).

We could argue about metrics, what are useful and what are not, for decades.  I agree metrics and benchmarks will have flaws, but in my view that does not make them irrelevant.

While undertaking academic research we learn that undertaking a study you must consider the &quot;threats to validity&quot; in the data collection and analysis.  Such threats, and I think this is what James and Cem are likely to refer to, exist in even the best scientific studies, even where use of double blind and control group testing.  It becomes very expensive to try to minimise the threats, yet even where various levels of threat exist, does this mean that empirical data is irrelevant and has no value - I don&#039;t think so.  I think that we can debate the data, and the interpretation of the meaning under consideration of the threats to validity.  Analysis of metrics help us to gain a better understanding of the processes that we use, and it should lead us as managers to ask more questions regarding the implications.

As a manager I often need to have evidence to reinforce my proposal or justify changes that I would like to make.  Without evidence it will come down to an argument dependent more on strength of personality than one of insight.

As a business manager there are lots of things that I like to obtain industry benchmarks, that way I can start to understand the health of my business.  I like to consider these items as part of our balanced scorecard, things such as:
- what level of profit is considered healthy for a consulting business
- what are the salary levels that we should pay for key roles within the organisation to stay competitive?
- what proportion of time should I invest in R&amp;D and training?
- how much should I spend on marketing and sales?
- how long does it take to recover payment on invoices?
- and so on

It&#039;s not that I want to do exactly what my peers or competitors are doing, but I want to reflect on where I am different and is there a strategic reason for this.  It also helps me reflect on efficiency of different parts of my business and I can start to see relationships between metrics.  It does not mean that I run my business by the numbers (or the ratios), but I use those numbers to gain insight and to make healthier management decisions.

Translating this into how I would run a testing project or department, I would want to understand similar things, but specific to testing outcomes:
- how much should I pay testing staff?
- what proportion of project budget is considered average?
- where am I finding defects in my development processes?
- how should I resource my team: from permanents, contractors or outsourcers?

By having an industry norm it enables us to see what most other organisations do.  It is not that we have to be average, but it means we then have a way of seeing where we are the norm, and then we can consider or explain why we have chosen a particular strategy.  It helps us understand where we may be different.

So that is the goal, but lets get back to threats.  I think a lot of issues will come out of the study.  How can we segment the data, so that way I might be able to refer to organisations that are similar to mine, in terms of industry, size, etc.?  How do we ensure that data collected is accurate, and we use similar measurement units?  Also the contributors to the survey are perhaps likely to be more sophisticated to quality and testing, those more unaware are less likely to participate.  So does this mean that results are further skewed - perhaps it does.

Some of the high level benchmarks are relatively easy to survey, e.g. perceptions and salary for instance.  However, some of the lower benchmarks, such as process and defect statistics are harder to collect accurately.   I think we need to try though, don&#039;t you?

At the end of the day I think starting to collect empirical data is step towards maturity of our industry.  I think our industry currently lacks empirical metrics, I can rarely find any published metrics from software testing.  I am sure we will get it wrong, studies will be (rightly) criticised, as well as perhaps incorrectly referred to.  However I think debate over such data will mean better understanding by the practitioners.</description>
		<content:encoded><![CDATA[<p>Hi Anne-Marie,</p>
<p>Good post to get some debate going.</p>
<p>The goal of the industry benchmark is to collate a profile of how organisation manage software testing and quality.  I have found through consulting with many large organisations that they want to know how others are doing it, they want to know where they are different.  We are trying to provide some visibility into this without having to separately visit each of these organisations yourself.  Also it becomes a basis for recommendation that I may have.  As a consultant, I could just say this is my idea, but if I can justify it with some practice-based evidence, then the stakeholders have confidence that they are not going it alone, bleeding edge, or just some new hair-brained idea of a crazy consultant (or new test manager!).</p>
<p>We could argue about metrics, what are useful and what are not, for decades.  I agree metrics and benchmarks will have flaws, but in my view that does not make them irrelevant.</p>
<p>While undertaking academic research we learn that undertaking a study you must consider the &#8220;threats to validity&#8221; in the data collection and analysis.  Such threats, and I think this is what James and Cem are likely to refer to, exist in even the best scientific studies, even where use of double blind and control group testing.  It becomes very expensive to try to minimise the threats, yet even where various levels of threat exist, does this mean that empirical data is irrelevant and has no value &#8211; I don&#8217;t think so.  I think that we can debate the data, and the interpretation of the meaning under consideration of the threats to validity.  Analysis of metrics help us to gain a better understanding of the processes that we use, and it should lead us as managers to ask more questions regarding the implications.</p>
<p>As a manager I often need to have evidence to reinforce my proposal or justify changes that I would like to make.  Without evidence it will come down to an argument dependent more on strength of personality than one of insight.</p>
<p>As a business manager there are lots of things that I like to obtain industry benchmarks, that way I can start to understand the health of my business.  I like to consider these items as part of our balanced scorecard, things such as:<br />
- what level of profit is considered healthy for a consulting business<br />
- what are the salary levels that we should pay for key roles within the organisation to stay competitive?<br />
- what proportion of time should I invest in R&amp;D and training?<br />
- how much should I spend on marketing and sales?<br />
- how long does it take to recover payment on invoices?<br />
- and so on</p>
<p>It&#8217;s not that I want to do exactly what my peers or competitors are doing, but I want to reflect on where I am different and is there a strategic reason for this.  It also helps me reflect on efficiency of different parts of my business and I can start to see relationships between metrics.  It does not mean that I run my business by the numbers (or the ratios), but I use those numbers to gain insight and to make healthier management decisions.</p>
<p>Translating this into how I would run a testing project or department, I would want to understand similar things, but specific to testing outcomes:<br />
- how much should I pay testing staff?<br />
- what proportion of project budget is considered average?<br />
- where am I finding defects in my development processes?<br />
- how should I resource my team: from permanents, contractors or outsourcers?</p>
<p>By having an industry norm it enables us to see what most other organisations do.  It is not that we have to be average, but it means we then have a way of seeing where we are the norm, and then we can consider or explain why we have chosen a particular strategy.  It helps us understand where we may be different.</p>
<p>So that is the goal, but lets get back to threats.  I think a lot of issues will come out of the study.  How can we segment the data, so that way I might be able to refer to organisations that are similar to mine, in terms of industry, size, etc.?  How do we ensure that data collected is accurate, and we use similar measurement units?  Also the contributors to the survey are perhaps likely to be more sophisticated to quality and testing, those more unaware are less likely to participate.  So does this mean that results are further skewed &#8211; perhaps it does.</p>
<p>Some of the high level benchmarks are relatively easy to survey, e.g. perceptions and salary for instance.  However, some of the lower benchmarks, such as process and defect statistics are harder to collect accurately.   I think we need to try though, don&#8217;t you?</p>
<p>At the end of the day I think starting to collect empirical data is step towards maturity of our industry.  I think our industry currently lacks empirical metrics, I can rarely find any published metrics from software testing.  I am sure we will get it wrong, studies will be (rightly) criticised, as well as perhaps incorrectly referred to.  However I think debate over such data will mean better understanding by the practitioners.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Anne-Marie Charrett</title>
		<link>http://mavericktester.com/software-testing-industry-benchmarks#comment-703</link>
		<dc:creator>Anne-Marie Charrett</dc:creator>
		<pubDate>Wed, 02 Sep 2009 11:05:45 +0000</pubDate>
		<guid isPermaLink="false">http://mavericktester.com/?p=283#comment-703</guid>
		<description>No worries</description>
		<content:encoded><![CDATA[<p>No worries</p>
]]></content:encoded>
	</item>
</channel>
</rss>
