24h-payday

Concept Search in E-Discovery: From Concept to Reality

by Kurt Leafstrand on January 30th, 2011

For years, concept search in electronic discovery has been like concept cars at auto shows: Cool. Slick. The thing that everyone is talking about.

But not ready to move to the assembly line and be put into production.

Like a concept car, concept search has been based on a lot of good ideas and shown a lot of promise. However, it has failed to move beyond a few edge use cases and reach mass adoption in the e-discovery market.  Why is this the case?

It’s not been because it’s an unproven idea or that the basic technology hasn’t been available. In fact, the core algorithm that underlies most existing concept search technologies has actually been around since 1988, when latent semantic analysis (LSA) was first patented by a team from Bell Labs. Over the last 20 years, dozens if not hundreds of companies have sprung up to apply concept search to the broad area of enterprise search and to e-discovery in particular.

To understand why concept search has never taken off, it’s always interesting to look for parallels, and the parallel du jour is social networking. Readers of David Kirkpatrick’s excellent book The Facebook Effect and (perhaps to a lesser, more fictionalized extent) viewers of the movie The Social Network understand that Facebook was far from the first social networking site (remember MySpace? You won’t admit it, but I know you do). But, despite being several years late to the party, Facebook somehow took the core of the social networking idea and presented it to users in a way that really allowed it to “cross the chasm” to the mainstream market.

In introducing Transparent Concept Search, Clearwell plans to help conceptual search cross that same chasm in e-discovery.  In talking to customers over the last couple of years, we have found that there are unmet customer needs with existing concept search products that, once addressed, will really allow its use in e-discovery to flourish – and not just in a way that makes concept search marginally more useful, but, a la Facebook, makes it orders of magnitude more useful.

What are these unmet customer needs?

Ease of use: Historically, concept search has been relatively easy to use in the strictest sense of the word – you type in some terms that represent your concept, and you get a set of search results back, along with some related terms and/or clusters of related documents. Simple, right? The issue is that in most cases that’s not what the user really wants to do. Because concept search is inherently “fuzzy”, users want to be able to refine their concept based on the feedback that they got from their initial search. Concept search, just like keyword search, is an iterative process, and prior-generation technologies have not allowed for that form of iteration. In contrast, Clearwell’s Transparent Concept Search allows concepts to be defined and refined in a way that is intuitive, visual, and (don’t take my word for it, but try it for yourself) fun.

Precision: Traditional concept search increased recall when compared to just keyword search, but it came at the cost of precision. The refinement process facilitated by Clearwell’s Transparent Search addresses this issue by allowing intelligent human input to guide the concept search process. You get the best of both the recall and precision worlds with vastly diminished time and effort.

Defensibility: Even more important than ease of use and precision is defensibility. Defensibility, for those new to the term, isn’t so much about whether the way the algorithms work is known and able to be understood. They are, and aren’t that complicated. Rather, defensibility is about reasonableness: was the concept search a reasonable way of determining which documents are responsive? Without the ability to define your concept in an interactive manner, we believe that the answer has historically been “no”, making concept search nice in theory but unusable in actual legal practice. Transparent Concept Search promises to change that. The end result is a more defensible search process that yields both greater recall and greater precision, enabling users to more quickly analyze case facts, rapidly identify key documents that may have been missed, eliminate irrelevant documents, and prioritize the most relevant documents for review. Clearwell also provides a reporting and auditing feature to document your search, allowing you to improve defensibility by “proving up” what was done.

Low cost: Finally, never underestimate the value of “free” in helping meet the ever-important unmet need of cost predictability and control. Historically, vendors have charged price premiums (often substantial) for concept search. Trying to charge a premium in e-discovery for something that doesn’t fully meet the customer use case and isn’t defensible, and it’s a recipe for low adoption. However, provide a highly useable, effective, and defensible capability as part of the core functionality of today’s leading e-discovery platform, and it starts to look very attractive indeed.

Hopefully you can tell that we’re incredible excited about the promise that this technology holds for the market, and this initial version is really just the beginning. Want to see it for yourself? Check out the video below, visit our web site or, if you are in New York this week, please visit us at LegalTech New York – we would love to see you.

2 Responses to “Concept Search in E-Discovery: From Concept to Reality”

  1. Tweets that mention Concept Search in E-Discovery: From Concept to Reality | e-discovery 2.0 -- Topsy.com Says:

    [...] This post was mentioned on Twitter by InfoGovernance, Mike Fishman. Mike Fishman said: Concept Search in E-Discovery: From Concept to Reality | e …: Like a concept car, concept search has been base… http://bit.ly/hWMJ3A [...]

  2. e-discovery 2.0 » Blog Archive » Plaintiffs Ask Judge Nan R. Nolan to Go Out On a Limb in Kleen Products Predictive Coding Case Says:

    [...] a wide range of technology tools available in the litigator’s tool belt including keyword search, transparent concept search, topic grouping, discussion threading, and predictive coding to name a few. Knowing which of these [...]

Leave a Reply