Archive for the ‘e-discovery search vendors’ Category

Can AccessData Halt Summation’s Death Spiral in Electronic Discovery?

Wednesday, August 11th, 2010

When I first started working in the electronic discovery industry, I quickly learned two things about Summation: it has a huge installed base of law firm customers, and they all dislike using Summation’s products. It was feedback from these unhappy customers that led companies like Clearwell and kCura/Relativity to enter the review market, and the results are plain to see. While Clearwell and kCura/Relativity are both growing rapidly, Summation has suffered years of declining revenue.

Several people have pointed to poor marketing as the problem, and it’s true most customers are confused. Summation’s products all have different names (iBlaze, Discovery Cracker, CaseVault, CaseVantage), and it is unclear how they relate to one another. But the problem is more fundamental than just marketing. There has been no innovation from Summation for years; its products are difficult to use; and, they don’t integrate with each other. So, naturally, customers switch to more compelling solutions, revenue declines, management cuts costs, talented people leave, service levels deteriorate, more customers defect, and the cycle repeats.

As the management teams at Silicon Graphics, Siebel, or Yahoo! can tell you, once a technology company faces this death spiral, it’s very, very hard to turn things around. But that’s exactly what AccessData must do for its recent acquisition of Summation to work.

On the face of it, you would not expect AccessData to be capable of addressing Summation’s problems. As the #2 player in the forensics market to Guidance Software, it has no experience in legal review. Its customers are enterprises and government agencies, not law firms or litigation support service providers. Its headquarters is in Lindon (Utah), whereas Summation based is in San Francisco. But AccessData has a capable team, and must have some plan in mind. What is it likely to do? My guess is as follows:

  • Claim “end-to-end” in the enterprise market: AccessData will likely bundle the iBlaze review platform with its own forensic collection products (FTK) and claim end-to-end coverage of the EDRM model. The products obviously don’t integrate with one another, or even have the same UI, but some customers may not realize how important that is until after they have purchased. This is the same strategy used by Autonomy, which also puts together disparate products (Aungate, Introspect, etc.) and markets them as an integrated package.
  • Promote CaseVault and CaseVantage in the law firm market: These hosted review platforms are not widely used. AccessData will be hoping that with better marketing and sales execution, it can drive adoption of them by law firms and litigation support service providers. But most providers today seem pretty happy with Clearwell and/or kCura Relativity, so it’s unclear why they would switch away to CaseVault / CaseVantage.
  • Cut costs: On the day the acquisition closed last month, AccessData fired most of Summation’s engineers. That’s understandable, given the shrinking revenue. But it only accelerates the death spiral. With no engineers, it’s impossible to innovate or improve the products.
  • Sunset iBlaze product lines: This sounds radical since, according to Katey Wood at the451 Group, iBlaze accounts for 70% of Summation’s revenue. But AccessData may decide to focus its development efforts on CaseVault and CaseVantage, ceasing all investment in iBlaze. Effectively, this means it would “milk” the law firms using iBlaze, and pitch enterprises a product with no real roadmap for improvement. Given how far iBlaze has fallen behind, there is a strong argument that further investments are probably just throwing good money after bad.

It will take a few months before we can say for sure whether these, or other, changes will make any difference. If the experience of other companies is any guide, they may slow the decline for a while, but not reverse it. After all, there may be some people out there using Silicon Graphics computers to access their Siebel CRM systems or search the web on Yahoo, just like there will be some using Summation’s products for document review. But there are fewer and fewer every day.

Learn More On Litigation Software & Litigation Support Software.

Learn More On Ediscovery Litigation.

2009 TREC Legal Track Sheds Light on Search Efficacy in Electronic Discovery

Tuesday, July 27th, 2010

In one of my previous posts, I had discussed the value and importance of TREC to the legal community. Clearwell Systems has been a TREC participant for the last two years, and believes in working with the rest of the participants in advancing the collective knowledge of electronic discovery-related information retrieval methodologies. TREC’s work has been conducted in the context of annual workshops, and is organized in the form of specific tracks. For legal professionals, the TREC Legal Track is the most relevant, and track organizers have just released the much-awaited overview of the 2009 workshop. I will summarize the key results from the study and its broader implications.

The overview paper is now available and covers the design of the two tasks within the track – the Interactive task and the Batch task. The Interactive Task is very relevant for the legal community, since it is designed specifically for analyzing the task of producing specific records in response to a “discovery request”. As noted in the paper, 15 teams participated, including 10 commercial teams, up from three teams in 2008. The 2009 study was also the first time an email collection (based on Enron emails released by FERC) was used.

The Interactive Task involves a “mock complaint” and seven different topics, with each topic described in the form of a general information request. Several teams participated by choosing one or more topics and submitting responsive documents for each.  These were then assessed using a mathematically sound sampling and estimation methodology, and effectiveness metrics were computed for each team.

The critical summary measure is F1, a combination of precision (estimate of false positives) and recall (estimate of false negatives). Overall, the highest F1 measure achieved across six of the seven topics was very good, as evidenced by values from 0.614 to 0.840. As an example, an F1 measure of 0.840 was achieved with a Recall of 0.778 and Precision of 0.912. This implies that the information request was satisfied with very few false positives (8.8%) and few false negatives (22.2%). Having a high precision implies that your reviewers will be reviewing fewer irrelevant documents, hence reducing your review workload and review costs.  A high recall ensures that very few documents were missed, so your case teams can be confident that all the facts of the case are examined.

It’s always important to look not only at the results, but the costs incurred when achieving said results.  We can break this into the costs that each team incurred, and the costs that assessment and topic authorities incurred. Unfortunately, the study did not track the amount of resources each team expended, so we will have to leave that as a possible improvement for a future study. To get a view of the second cost, a review of the tabulation of team interactions with topic authorities (Figure 3 of the overview paper) is helpful. In this study, the topic authority plays the role of a case expert. The numbers show that for some topics, a highly acceptable F-measure (over 0.75) was achieved even with interactions of 100 minutes, well below the 600 minutes allocated for each team. This would indicate that the teams were able to understand and construct meaningful searches with very reasonable amount of involvement of a case expert.

The other interesting conclusion is that there is value in selecting a corpus containing attachments. The study found that attachments increased the value of responsiveness by measuring the “document to attachment” ratios. For the responsive set, this ratio was a significantly higher value of 4.8 (i.e., responsive document families had, on average, one message and 3.8 attachments), while the entire population had this ratio at 2.2. This suggests that using the Enron corpus that contained attachments was a very good decision.

Of course, the most revealing, controversial finding is with respect to the Assessment and Adjudication phase of the project. As noted in the overview paper’s section 2.3.4, the rate of success of appeals was significant, ranging from 82% to 97%. In other words, the initial sample assessments were reversed in an astonishingly large number of cases. One could argue that the appealed documents were carefully selected, but that argument is weakened by the varying number of appeals by participating teams, and the success rate for even the teams with larger number of submissions. As noted in the paper, the teams that invested greater amounts of resources in the appeals phase benefited proportionately in the levels of improvement of their final precision and recall numbers. I know that constructing appeals can consume a lot of resources since, in addition to the normal information retrieval task, you are required to provide a convincing argument for reversing an initial judgment. This becomes very much a review exercise, not unlike the traditional manual review that the broader legal industry has been struggling with. For example, our own appeals budget was limited, forcing us to sample the appealed documents and select only a few. The outcome of this is that un-appealed documents are all assessed as relevant, which is unsubstantiated by the large number of appeals. In the final analysis, section 2.4.2 illustrates a salient indicator of success – teams that had a positive and useful interaction with the topic authority had the greatest success of initial assessments as well as success in appeals, and the ones that leveraged this for the greatest number of appeals had reported the greatest F-measure.

The 2009 study saw a significant increase in participation from commercial teams. My own personal observation is that unlike academic teams, commercial teams tend to evaluate their participation in TREC projects through the narrow prism of short-term return on investment. While there is value in contributing to the community, I am sure each team is asked to justify the benefits of participation to their management. Some would argue that the full benefit is not realized because of the restrictions placed on dissemination of results within the broader community, especially in the area of marketing the results. I am sure every commercial participant would want to promote their performance, and highlight how their technology and methodology was superior. Given that such direct comparisons are not permitted, the ability to market your results is severely curtailed. The potential for comparative analysis could be a powerful motivator for all participating teams to invest more in the exercise, with the final outcome that the community benefits.

As I noted in my previous post, the legal e-discovery profession needs an independent authority that can challenge vendor claims and provide objective validation of one of the most complex areas of e-discovery – search and information retrieval. TREC has stepped in and served that need very effectively. And, this has been deservedly noticed by the people that matter – Justices of cases involving electronic discovery, expressing their opinions regarding “reasonableness” with respect to cost-shifting, adverse inference, motion to dismiss and other judgments.

A study of such magnitude is bound to have certain flaws, and these are documented in Section 2.5. Leaving aside these shortcomings, the TREC Legal Track effort is immensely useful for both participants and consumers/users of legal technologies and services. The value offered to the community by such studies is well captured in the companion report, titled the Economic Impact Assessment of NIST’s TREC program. As the TREC coordinators are rolling out their new TREC 2010 Legal Track tasks, it is obvious that continued improvements in both the design and execution will make it even more attractive for all participants. Clearwell Systems is committed to the overall goals of TREC and intends to continue their involvement in the TREC 2010 Legal Track projects.

Learn More On Litigation Support Software & Electronic Discovery Litigation

Courts Undecided on How to Handle Email Threads in Electronic Discovery

Monday, June 21st, 2010

Much of the business and personal productivity that comes in the digital world  is from email and its unique abilities. Email allows us to communicate in a way that helps us associate context to our discussions, namely in its ability to be chained into a sequential thread when email users reply to or forward emails they previously received. This accomplishes two important tasks: 1) it allows the person sending the reply or forward to get an understanding of the issues so he/she can craft a meaningful response, and 2) it allows the person receiving the response to understand that response in the context of other on-going discussions. Email programs such as Microsoft Outlook, Eudora, and Gmail help by automatically including content from prior emails, thus producing a long chain of reference.

It is no coincidence that emails thus constitute key evidentiary value in the context of litigation. The inherent value captured in emails is what makes email productions central to pre-trial disclosures and the electronic discovery that precedes it. Courts have long recognized that emails are a business record and subject to discovery. Establishing who said what in the context of a matter in dispute is greatly facilitated by examining the thread of emails recorded in email repositories. With respect to electronic discovery, however, email threading presents several unique challenges. The area of greatest confusion and uncertainty has been the determination of privilege when emails are exchanged with in-house counsel and attorneys and whether such emails are protected by attorney-client privilege or not. A central issue is the composition of privilege logs under these circumstances.

There are several legal opinions on the matter of intermingling privileged and non-privileged communications in an email chain. These opinions have left the matter with little clarity, especially regarding whether the entire email thread is privileged or whether individual emails must be separated out and classified as privileged, with a privilege log listing them. Typically, the most recent email in a thread contains all other emails in that thread. Separating out individual emails (i.e., the contained emails) from the containing email would allow for treatment of just the portions of the email thread that may have privilege. When such separation is permitted, some contained emails may be assessed as privileged while others may not. However, it is entirely possible that the contained email is also present as an independent email under possession of the same custodian or another custodian. When it is present, one could argue that the contained email can just be ignored, and if the corresponding email is responsive, one can ignore the contained email. But rarely does a collection include a complete set of custodians, so the question of whether the privilege log should include the contained item in question still remains. In terms of management of review, and for constructing a privilege log, treating the most recent email and all its contained emails as a single entity is less expensive and cleaner than separating and determining privilege status of each contained email.

Another complicating factor is simply a determination of privilege. Does the mere fact that an attorney was listed as a courtesy CC recipient make the entire email privileged? And, when such emails are then forwarded only to an attorney involved in the case, with a legal strategy discussed in the containing email, is only the new content added to the containing email privileged, or does the privilege determination extend to the other contained emails?  Let’s examine a few opinions for guidance.

With respect to privilege there is a significant body of opinions that would suggest that only communications that explicitly seek legal advice are privileged.

“With respect to internal communications involving in-house counsel, a party “must make a ‘clear showing’ that the ‘speaker’ made the communications for the express purpose of obtaining or providing legal advice”, Chevron Texaco Corp., 241 F. supp 2d) at 1076 (quoting In Re Sealed Case, 737 F.2d 94 (D.C. Cir. 1984)). If the legal and business advice are inextricably intertwined, “the legal advice must predominate over the business advice, and not be merely incidental, for the communications to be protected under attorney client privilege.” Evidently, attempts to include an incidental attorney in a thread would not offer privilege protections. However, the issue is complicated if the most recent containing email is indeed a genuine attempt to seek such guidance. Here again, there are two opinions. In United States v. Chevron Texaco Corp., 241 F. supp. 2d 1065, 1074 n.6 (N.D. Cal. 2002), we note that:

“With respect to each series of emails for which Chevron asserts protection under privilege, Chevron breaks the series into each discrete message. In our view, such a representation of the document is misleading. Each email/communication consists of the text of the sender’s message as well as all of the prior emails attached to it. Therefore, Chevron’s assertion that each separate email stands as an independent communication is inaccurate.”The above would have you prepare a single entity with the most recent containing email and all other quoted emails treated as a single unit. On the other hand, we see the opposite opinion in Universal Service Fund Telephone Billing Practices Litigation, 232 F.R.D. 669, 674 (D. Kan. 2005) where “the court strongly encourages counsel, in the preparation of future privilege logs, to list each email within a strand as a separate entry”. In a related ruling, the court notes: “Obviously, a sufficient (i.e., reasonably detailed) privilege log is vital if litigants and judges are to determine whether documents have been properly withheld from discovery.” As mentioned earlier, this can be much more expensive from a review and production standpoint.

In Chemtech Royalty Assoc., L.P. v. United States, Nos. 05-cv-00944, 06-cv-00258, 07-cv-00405, at (M.D. La. Mar. 30, 2009), we get another perspective: “Asserting privilege for an entire email thread in the privilege log, but only describing the last message in the thread is deficient.”

In Baxter Healthcare Corp. v. Fresenius Med. Care Holding, Inc., No. 07-cv-01359, 2008 BL 229777 at (N.D. Cal. Oct 10, 2008), the defendants are ordered to produce a privilege log that “separately identifies the author, recipient(s), copyee(s), and blind carbon copyee(s) for each logged email communication regardless of whether the communication is part of an email string”. The court directive is: “Each email is a separate communication, for which a privilege may or may not be applicable. Defendants cannot justify aggregating authors and recipients for all emails in a string and then claiming privilege for the aggregated emails.”

Thus, the contained emails must be treated as separate privilege log entries.

In Vioxx Products Liability Litigation, 501 F. Supp. 2d 789, 812 (E.D. La 2007) the court notes:

“Email threads in which attorneys are ultimately involved were usually listed on the privilege log as one message.”  Further, “Simply because technology has made it possible to physically link these separate communications (which in the past would have been separate memoranda) does not justify treating them as one communication and denying party a fair opportunity to evaluate privilege claims raised by the producing party.”

Again, the preference has been to separate out individual contained emails as independent emails with corresponding privilege log.

In C.T.  v.  Liberal School District, Nos. 06-cv-02093, 06-cv-02360, 06-cv-02359, 2007 BL 21826 at (D. Kan. May 24, 2007), the court orders the plaintiff to submit an amended privilege log that listed email in a string as a separate entry.

In Se. Pa. Transport Authority v. Caremark PCS Health, L.P., 254 F.R.D., 253, 264-65 (E.D., Pa 2008) court recommends “analyzing emails in chain separately to rule on defendant’s privilege claims”.

Another significant opinion is found in Muro v. Target Corp., 250 F.R.D. 350 (N.D. Ill. 2007). In addition to at least four motions, an in camera review  was requested for identifying the privilege status of eighty nine documents. Here, the court ruled that FRCP Rule 26(b)(5)(A)  does not require that all contained emails be separated out. However, the court sustains Target’s objection to the Magistrate Judge’s ruling that its privilege log was inadequate for failure to separately itemize each individual email quoted in an email string. In Muro, though, you are allowed to treat an entire email as a single entity only if the non-privileged communications in that chain are otherwise disclosed. Hence, if you wish to treat an email as a single unit, you are required to either disclose the individual contained emails from other custodians, or to list them as Derived Emails (see below).

Another important case is the Rhoads Industries Inc. v. Building Materials Corp. of America et al 2008, WL 5082993 (E.D. Pa Nov. 26, 2008), where the court rendered the opposite opinion:

“Each version of an email string (i.e., a forward or reply of a previous email message) must be considered a separate, unique document, and therefore each message of the string which is privileged must be separately logged in order to claim privilege in that particular document.”

Of course, the context of the Rhoades opinion is the statement: “In the world of electronic communications, a series of email messages, among people employed by the client, but working in different locations, can replace the meeting with an attorney and subsequent letter.” However, this opinion is very debatable.

An entirely different approach is suggested in Apsley v. Boeing Co., No. 05-cv-01368, 2008 BL 12035 at (D. Kan. Jan 22, 2008), with the opinion “Although Boeing listed on its privilege log entire email strings, it redacted only the portion of the string that contained legal communications.” While this seems to be a perfectly reasonable approach, wouldn’t this compromise case strategy since the very fact that certain portions of the non-privileged, unredacted emails were being exchanged with in-house counsel and is therefore part of an attorney communication can be damaging?

Suffice it to say, the courts differ in their opinions on how to handle email threads and their privileged logs. It is in this context that the Clearwell E-Discovery Platform’s treatment of email threads is extremely helpful for preparing your litigation response. In fact, Clearwell has received two patents related to email threading, one for constructing email threads and its ranking and another for determining derived emails from other containing emails and de-duplication in the context of original emails. Clearwell has advanced email meta-data and content analytics to piece together all emails of a thread. Furthermore, its Derived Email feature separates out contained emails as complete emails, which are then de-duplicated against other emails that are not derived from a contained email. In situations where such a duplicate is not identified, the derived email is maintained in a special state. Also, the containing email’s thread is separated out in such a way that each individual email’s privilege status can be determined. One can apply either a single- or multiple-record policy satisfying whatever the prevailing opinion is from the bench. Also, Clearwell’s redaction capabilities and its ability to produce the same set of documents for multiple parties allow the case team to provide a quick turnaround if there is a motion to produce either a privilege log or the non-privileged snippets of emails. Such technology can be a lifesaver when it comes to meeting electronic discovery obligations.

Go With the (Work)flow in Electronic Discovery

Thursday, June 10th, 2010

Recently, I attended a conference in Washington DC with a large number of government agencies, including (I must confess) many Clearwell customers like the Department of Health and Human Services, the Department of Homeland Security, and the Veterans Administration. It will probably come as no surprise that, during our conversations, it became abundantly clear that they had substantial electronic discovery technology needs. Many were still reviewing PST files manually in Outlook; others were TIFFing millions of pages of documents prior to directly loading into a traditional review application for eyes-on review. That’s right, nary a trace of early case assessment, transparent search, or culling to be found.

Sadly, no news there. What was fascinating for us was the reaction to the latest release of the Clearwell E-Discovery Platform, Version 5.5. Version 5.5 contains significant new functionality, including dramatically increased performance and scalability along with a number of substantial processing, analysis, review, and production enhancements. But, in addition to these features, we have rolled out a set of e-discovery best practices templates designed to make it vastly easier for organizations to implement a formal e-discovery methodology that builds on the integrated nature of our platform. And it was the prospect of such a methodology, even more than the technology, that people were buzzing about at the summit.

Why? With all of the activity going on in the e-discovery space around product and technology innovation, there was some strong feedback that process and methodology may have gotten lost in the shuffle. And, if you think about it, it’s process and methodology that are likely to be most carefully assessed when the courts are considering the reasonableness (or lack thereof) of e-discovery for a case.

The importance of putting process and methodology front and center (along with a commitment to making the necessary organizational changes to make it happen) is not exactly a new concept. Ralph Losey has been talking about it for years over on his groundbreaking and irreverent e-discovery team blog, and it’s a frequent topic of keynote speakers on the e-discovery lecture circuit. However, like eating your vegetables or exercising, putting in place the right e-discovery process in an organization is something that people realize the benefit of, but still ignore.

This cannot continue, as the stakes are escalating. Take the recent case of Mt. Hawley Ins. Co. v. Felman Prod., Inc. Dean will dive into this case in much greater detail in an upcoming post, but it is very relevant to the methodology versus technology discussion in that it highlights how a methodology problem can cause a fateful technology problem to be overlooked. In this case, a lack of sufficient quality control processes caused the plaintiff to inadvertently produce a number of privileged emails. The court found the inadvertent production was not “solely attributable” to a problem with a Concordance index, and that the plaintiff “failed to perform critical quality control sampling” to determine whether the production was appropriate. Privilege was waived.

What’s the solution? We believe that we’re on to something with Clearwell 5.5, in that we can, uniquely among e-discovery products, marry together methodology and technology in a single platform that allows for the entire e-discovery process to be documented and defended, end-to-end. We have particularly focused on the most critical part of the process which seems to come up over and over again in sanction and privilege waiver decisions, which is the way that an organization moves from an initial pool of documents to a set of defensibly-culled, potentially responsive documents, on through to tagging and production. Our unique workflow capabilities allow the entire process to be documented and instantly recalled with the click of a mouse, letting you see each decision that was made during the course of the case in a step-by-step fashion, and then to structure additional quality control audits on top of those decisions to ensure that every “i” is dotted and every “t” crossed.

It’s a good thing for everyone involved in litigation that e-discovery technology is maturing rapidly to the point where it can start to help solve these sorts of process problems rather than being the cause of them (as was the unfortunately case in Mt. Hawley). This is a major focus for us at Clearwell and you’ll see a lot more exciting news from us on this front over the next few months, so stay tuned!

What’s Next For Kroll Ontrack?

Tuesday, June 8th, 2010

Yesterday, Marsh & McLennan (M&M) announced the sale of Kroll, its investigative services division which last year generated $678 million in revenue. Kroll is being acquired by Altegrity, another investigative services company which is owned by Providence Equity. The acquisition price is $1.13 billion, below the $1.3 billion M&M was rumored to be asking, and the deal is financed by Apollo Investment Services and Goldman Sachs.

There are many aspects to this transaction, but I want to focus on just one: what does this mean for Kroll Ontrack, Kroll’s largest division with $250 million in revenue and a staggering 1,500 employees, making it by far the world’s largest e-discovery service provider?

To answer this question, I will first outline the strategic challenge facing Kroll Ontrack, before outlining two alternative strategies its new owners may adopt for addressing it.

Strategic Challenge: Kroll Ontrack Is The “Yahoo! Of E-Discovery”

Just as Yahoo was an internet pioneer in the 1990s, Kroll Ontrack was the pioneer of electronic discovery services. Like all pioneers, as the first to market, Kroll had to build everything itself. So Kroll Ontrack invested not only in recruiting and training its staff of skilled consultants, the company also developed its own suite of e-discovery tools and software. It offered this integrated package of services and software to the market and, justifiably, charged a price premium.

But as the industry matured, it disaggregated with more savvy customers and new companies focused on specific parts of the value chain. Customers became better educated and more confident making decisions, diminishing the value of Kroll’s “we-are-the-safe-choice” value proposition. These customers today have many more options for e-discovery than was the case in years gone by, primarily because of a generation of e-discovery software companies, such as Clearwell, Guidance, Exterro, and kCura/Relativity, which offer capabilities like collection, ECA (Early Case Assessment), litigation hold management, and linear review. These have been widely adopted by Kroll Ontrack’s competitors, negating Kroll’s technological advantage. Even worse, because Kroll Ontrack’s competitors do not need to invest in R&D, they have a substantially lower cost structure. As a result, they have undercut Kroll Ontrack on price, which has halted its growth and squeezed its margins.

In a directly analogous way, Yahoo! has seen its broad internet service to consumers eroded by a host of more focused competitors such as Google, Facebook, and Skype. Consumers today are much more familiar with the internet, and feel comfortable making separate choices for search, social networking, and messaging, without the need for an umbrella brand. That has left Yahoo! without a reason for being: even today, its CEO struggles to answer the fundamental question “what is Yahoo!?”

Solution: Sell It Or Fix It

As Kroll Ontrack’s new owner, Altegrity has a simple choice. It could sell Kroll Ontrack, making the strategic challenge someone else’s problem; or Altegrity could fix it, by adopting a fundamentally different strategy.

Let’s consider each in turn:

Sell It: Most sensible people would find it funny to think about selling something right after you bought it. But in this case, it could make a lot of sense. Altegrity is a leading provider of investigative services, not e-discovery, making the “non-Ontrack” part of Kroll’s business a much better fit. So why not sell Kroll Ontrack, pay down debt, and focus on the services business which it understands? This would be especially attractive if, as Vivian Tero at IDC suggests, there are willing buyers such as ECM or storage software companies which like Kroll Ontrack but do not want the services business.

Fix It: Mike Cherkasky, Altegrity’s CEO, is a former head of Kroll, and so is perhaps uniquely well placed to bring about a change in direction. To do so, he must decide what Kroll Ontrack wants to be. If its goal is to be the leading e-discovery service provider, then it should kill its internal software development efforts and focus on providing customers the absolute best service using industry leading tools. If it wants to be an e-discovery software company, which would be a much harder transition, then it needs to exit the services business and make its technology available every litigation support company.

Either way, it will take time and a lot of painful decisions for Kroll Ontrack to recover its momentum. But if any encouragement is needed, the Altegrity and the Kroll Ontrack teams need only look at what’s happening to Fios, another of the industry’s early pioneers. So far, Fios has refused to decide what it wants to be, abandoning its internal review platform for Relativity but keeping its proprietary processing software. The result? It’s had three different CEOs in the past 12 months, and competitors continue to steal market share.

Learn More On Litigation Software & Litigation Support Software.

Kroll Ontrack and Iron Mountain Stratify Demonstrate That “Free” Is Usually NOT The Cheapest Solution For Electronic Discovery

Tuesday, June 1st, 2010

Every car dealer knows he should focus customers on the monthly payment, not the total cost of the car. Every credit card solicitation (or sub-prime mortgage, for that matter) starts with the offer of 0% interest, not the actual interest rate or fees the customer will pay after the first 6 months. The reason is simple: once you lease the car or put a balance on the credit card, it’s very hard to switch away when – as often happens – you find yourself paying much more than you should later on.

I was reminded of these examples when reading about Kroll Ontrack’s offer of “free ECA” and Stratify’s recent press release announcing “free early stage filtering” for electronic discovery. Taking each in turn:

Kroll Ontrack Advanceview

Based on feedback from several customers in Washington DC, New York, and the Mid-West, Kroll Ontrack often provides Advanceview at no charge. That means customers can get “custodian de-duplication” and “1 keyword and date filter pass” for free, although Kroll still charges $200-250/hour for doing the work. The resulting data set is then processed and loaded into its review platform for $1,500-$1,800 per gigabyte.

Is this a good deal? For the vast majority of customers, the answer is “no” for three reasons.

First, customers typically end up paying more than they would using alternative products. For example, in the chart below, we compare the cost of using Kroll Ontrack to that of Clearwell for a 100 gigabyte project. In both cases, we assume customers are doing de-duplication, filtering, keyword searching, first pass review, and load file creation. As with any comparison of this sort, you have to make some simplifying assumptions. For example, we excluded data hosting fees and professional services fees from the analysis.

Whether customers are better off with Kroll depends entirely on how much data is culled out for free before customers incur the high, back-end charges. Given that all Kroll is doing for free is custodian de-duplication and running one set of keywords and date filters, the typical cull rate is likely be anywhere from 20% to 50% — nowhere near the 80% cull rate required for Kroll to be more cost effective than Clearwell.

The second reason why this is not a good deal is that it gives customers no certainty about costs. Culling rates from de-duplication and blind keyword searches are unpredictable and vary widely, meaning that some projects will cost more than expected while others will cost less. But every project has budget that’s determined up front and, as any litigation support manager will tell you, you get much less credit for being under budget than you get pain for going over budget. That’s why cost certainty is one of the leading requests from anyone involved in electronic discovery.

Finally, excluding data based on a single round of keyword searches and date filters is not in line with The Sedona Conference best practices. Rather, Sedona recommends that customers iterate their keywords and culling strategies to hone them appropriately.

Iron Mountain Stratify OnPoint

It is not yet possible to do the same detailed analysis on Stratify’s OnPoint which offers “free early stage filtering”, because it’s impossible to tell exactly what that means. In its artfully-worded press release and data sheet, Stratify promises to provide “free processing and loading of unlimited data for early stage filtering”. Does that include de-duplication? Does that include any keyword searching? My guess is “no”, in which case all they are really doing for free is offering to load data into their review platform so that they can then charge you – not a very compelling offer. But if anyone does know the answer to these questions, or if Stratify would like to clarify exactly what’s being offered for free, then please let me know and I’ll post an update.

Once data is in Stratify’s system, it charges a “one-time fee starting at $500 per gigabyte” for “reviewable data”. But it does not say if that’s the only fee. What about monthly hosting charges? Fees for additional reviewers? Again, it’s not yet clear what the downstream cost of review really is using Stratify, so it’s impossible to know whether this is a good deal.

If there’s one lesson from all of this, it’s “buyer beware”. Just as when you buy a car, sign up for a credit card, or click on that offer to get more corn on Farmville, you need to look beyond the “free offer” and understand what it’s really going to cost you.

E-Discovery and the Cloud: The Duty to Preserve Electronically Stored Information (ESI)

Friday, May 28th, 2010

One of the new buzz words of the last few years in computing has been Cloud Computing. After the initial hype, and the subsequent shakeout of its potential, everyone is beginning to recognize that it represents a paradigm shift in how we purchase, deploy, and utilize computing resources. The general impetus for the cloud has been its potential to reduce capital costs, offer flexibility in purchasing computing resources, and reduce operational costs in maintaining hardware resources.

A lot of what the cloud offers is achievable using existing technologies, but repurposed in new and innovative ways. Several forms of the cloud, with specific benefits to customers, are being packaged and promoted. The offerings are delivered as cloud services, such as Platform as a Service (PaaS), Infrastructure as a Service (IaaS) and Software as a Service (SaaS). Without getting into specifics, each service offering comes with a set of service agreements between the purchaser and provider of the cloud services.

As with any new initiative, there are new challenges to contend with including security and compliance with corporate policies and industry regulations.  Although these issues are substantial, for this article, let us consider the legal implications as it relates to electronic discovery. We all know that sooner or later, every organization faces litigation, and increasingly, fair number of them involves e-discovery. Traditionally, in house legal and IT teams have had an understanding of how to respond to legal requests and have focused on litigation readiness. But, how do these translate to the new cloud computing paradigm? I’ll examine some of the challenges in a series of posts on e-discovery and the cloud. For starters, let’s analyze the challenges and considerations inherent with the duty to preserve electronically stored information (ESI).

Duty to Preserve ESI

Before we get to the mechanics of electronic discovery and actual preparation for Rule 26(f) conference, the duty to preserve arises. The duty to preserve may be triggered when a legal proceeding is “reasonably anticipated” and increases in importance on receipt of pre-litigation correspondence or a similar trigger event. Traditionally, such duty to preserve is reflected by placing litigation holds. It is often the case that litigation holds are placed on at least a portion of the ESI well ahead of an actual triggering event. See Adams v. Dell as perhaps an extreme example. In fact, some organizations invest in litigation support software technologies for classifying data and placing holds on the most reasonable subset.

How does such a litigation hold translate into the cloud? As a customer of a cloud, one should craft service agreements to dedicate certain cloud-resident data, in the form of folders or other broad categories, to be preserved. If the cloud provider has deployed technology to ensure that no party within the customer’s user community can delete the preserved data, it is well and good. However, placing such restrictive access impedes normal running of the business, and becomes impractical. Essentially, data in the cloud that is available for normal course of business is in the hands of user-custodians. If they then delete the data either deliberately, or inadvertently, or through normal business functions, that data deletion is subject to spoliation claims. Even though the “safe harbor” from spoliation sanctions of Rule 37(f) applies when information is lost due to the “routine, good faith” operation of electronic information systems, when preservation order is in place, shelter under 37(f) is not possible. Thus, the actual implementation of litigation hold comes under scrutiny. Because of this, many implementations adopt preservation using a “copy and preserve” model. However, this model is at odds with live business data that is constantly evolving. Even if the latest point-in-time snapshot technology at the physical volume is employed, the result is inadequate – you end up preserving massive volumes of data in the cloud, unrelated to actual logical messages or files that need to be preserved. What is needed is some smartness in the form of an application in the cloud itself that can translate a litigation hold request into specific ESI in the cloud. Who owns and manages this application and what the service levels are for this application is a significant issue.

Now, the view from the cloud provider’s perspective is very different. In light of the flexible data management architectures available, there is a great temptation to share both data with a litigation hold and data without a litigation hold on the same physical infrastructure. As a result, the cloud provider   preserves all data from every customer that is resident on that infrastructure – a very conservative approach. As a consequence, this would preserve another customer’s ESI accidentally and that data is now discoverable, in the context of a different litigation, despite the second customer’s active management of the data. Preserving a set of live, constantly changing data in the context of a single enterprise is technically difficult; doing so across multiple customers, sharing the data infrastructure is exponentially harder.

Another related issue with preservation is the need for the ability to release preservation holds. Typically, when the litigation response team determines that the legal hold is not necessary, the hold is released. In the “copy and preserve” model of litigation hold, one has to verify that the released ESI does not overlap with other litigation holds and is marked for destruction. One of the benefits of the cloud is the flexibility in storing bits and pieces of data wherever data capacity is available. Applying the release can again be tricky for both cloud customer and the cloud provider.

Given these additional complexities of evidence in the cloud and the fact that the duty to preserve may arise well before the trigger event of litigation, the costs associated with the duty to preserve can add up very quickly. It’s essential to understand three critical items related to the duty to preserve in the cloud: 1) what the cloud provider would charge for ongoing preservation, 2) whether agreements with the cloud provider cover the legal issues raised by the duty to preserve and 3) what the cloud provider offers in terms of a flexible workflow for applying and releasing legal holds.

Learn More On Litigation Software & Electronic Discovery Litigation

Clearwell And Apple Partner to Bring E-Discovery To The iPad

Wednesday, March 31st, 2010

So far, most coverage of the iPad’s release on April 3 has focused on its likely impact on the media industry. Largely unnoticed, but just as profound, is the effect it will have on electronic discovery. Because the iPad is the first truly user-friendly device that enables people to do e-discovery any time, any place, and any way they can imagine,  it promises a revolution.

We saw this early on at Clearwell and, while the iPad was still in development,  partnered closely with Apple to ensure the device meets the needs of the e-discovery community. This required tight integration between the iPad’s unique capabilities and those of our own e-discovery platform, and has taken several months of close collaboration. The result is nothing short of spectacular, as you can see for yourself right here.

Read the official announcement from Apple and Clearwell and pre-register for an E-Discovery iPad now. It makes PC-based e-discovery feel like sifting through stacks of paper in a warehouse.

Learn More On Ediscovery Litigation.

Can an In-House E-Discovery Solution Be Built in a Day?

Monday, March 8th, 2010

After more than ten years of IT experience and over a year of experience as an attorney working exclusively with e-discovery, I am delighted to join the E-Discovery 2.0 team.  I am a member of the South Carolina Bar Association and the American Bar Association.  In this and future posts, I will try to bring a practical perspective or view from the trenches to this blog – a look at how to deal with some of the day-to-day problems facing e-discovery practitioners today.  I will begin with a discussion about how to approach the decision to move e-discovery in-house, and although the desire to build a solution “in a day” is tempting (and sometimes precipitated by necessity), a solution that will stand the test of time and provide the greatest ROI requires a bit more planning and care.

E-Discovery can sometimes be thought of as an ailment that requires a quick remedy in the form of software or services.  We continue to be reminded, however, that e-discovery is much more than a fleeting malady; it is an ongoing business problem that must be treated with the same diligence and meticulous execution as regulatory compliance or data security.

So where should the prudent practitioner begin?

Every good IT project manager I have ever worked with always had the same mantra when it came to solving a problem with technology – make sure the business problem has been well defined and establish detailed requirements before venturing into the marketplace.  So, why are so many companies sending out form RFPs containing canned text expecting to find a miracle “end-to-end” e-discovery solution in a relatively short period of time?  The answer, I believe, lies both in the abundance and availability of generic information about e-discovery and the fact that most companies looking to bring e-discovery in-house are already feeling the pain of rising costs and demands on existing staff.  They are, in short, trying to conquer their e-discovery problem in a day.  To truly conquer the problem, it should be attacked from the areas causing the greatest pain and expense first, and those areas should be thoroughly examined using proven project management techniques.

If e-discovery is indeed a significant business process, then companies must address that problem using the same proven methods that they have been using for years to solve other business problems.  For example, every company today, believe it or not, has an e-discovery solution in place.  If the company was sued tomorrow, and there was a significant e-discovery component to the matter, the company would likely react in a certain way based on a number of factors – hire outside consultants, work with a litigation support provider, rely on their outside counsel to coordinate e-discovery, etc.  So why not predict that reaction, analyze it, and determine where the greatest expense and pain lies in that process?  From that data, the company can decide which portions of the e-discovery workflow, if any, should be brought in-house, and it can seek out best-of-breed solutions rather than settling on the first end-to-end vendor that comes knocking.  The next step is to rely on those time-honored project management edicts – define the business problem and establish concrete requirements.  Then the company will be armed with the most powerful weapon in the marketplace – the power to distinguish.

The burning question, then, is how does the company decide which portions of the e-discovery workflow to bring in-house?  The answer is relatively simple: you follow the money (right out of the front door in many cases).  Where is the company spending most of its e-discovery budget, and are those portions of the workflow good candidates to bring in-house?  Typically, processing data and review are the most expensive phases of any e-discovery project.  The logic here is simple: if you send 100GB of ESI to outside counsel to review, it will be more expensive and time-consuming than sending only 20GB.  Thus, processing, analysis, and first-pass review are great candidates to be brought in-house from an ROI perspective, and bringing these phases in-house could facilitate a form of early case assessment given the right solution.

Now, suppose a company decides to bring processing, analysis, and first-pass review in-house, also leveraging their chosen technology solution for early case assessment.  Now what?  The process can simply be repeated.  Given the solution implemented, what happens if we get sued tomorrow?  What other portions of the e-discovery workflow will need to be outsourced and how will we do that?  What will that cost?  Is there a better way?  The company can continue this process until it determines that either all portions of its e-discovery workflow have been successfully brought in house or the ROI of bringing additional portions of the workflow in house does not justify additional projects at that time.  This analysis should then be repeated on a regular basis to ensure the current solution is still meeting the needs of the organization and that market or industry shifts have not created additional opportunities for cost savings.

Although an effective and defensible in-house e-discovery solution likely cannot be built in a day, a carefully crafted plan of attack and a thorough understanding of the organization’s particular needs can strategically position it for long term success.

Frcp Electronic Discovery.

Why Did Iron Mountain Digital (Stratify) Acquire Mimosa, And What Does It Mean For The Archiving / E-Discovery Industries?

Wednesday, February 24th, 2010

Yesterday, I explained what I think Iron Mountain’s acquisition of Mimosa says about valuations in the archiving / e-discovery industry. Today, I will address the other questions that people commonly ask about the deal – why did Iron Mountain (Stratify) do it, and what does it mean for the electronic discovery industry?

In their letter to customers announcing the deal, Ramana Venkata (President of Iron Mountain Digital) and TM Ravi (CEO of Mimosa) point to two main benefits from combining the companies. On the archiving side, Iron Mountain can now offer Mimosa as an on-premise solution in addition to its existing hosted service. If it can integrate the two, then it can offer “location-independent” archiving which “will help you transparently and seamlessly move data between the on-premises data center and the cloud.” One additional benefit to Iron Mountain, which is not mentioned in the letter, is that it could even leverage Mimosa’s technology for its hosted offering, and replace Mimecast who it currently pays to provide this service.

On the e-discovery front, Iron Mountain now has a suite of 2 products and 1 service: Mimosa NearPoint for collection and preservation; the Stratify eVantage appliance for ECA (Early Case Assessment); and, Stratify Legal Discovery Services for review and production. This makes Iron Mountain a competitor to Autonomy, Clearwell, EMC/Kazeon, and everyone else listed in Gartner’s recent MarketScope covering e-discovery software companies. I’m sure the hope is that there’s synergy between the different products so that, for example, Mimosa’s experience in on-premise software will help Iron Mountain drive adoption of its new Stratify eVantage appliance behind the firewall.

Will the combination work? As Barry Murphy (a former Mimosa employee) points out in his excellent post on this topic, a lot depends on execution. But there are at least 2 reasons to be doubtful. First, the competition is far ahead, and will be hard to catch. As Barry, points out: “Iron Mountain will have a tough road ahead to compete with the likes of Autonomy, which bought successful archiving company Zantaz and has now had almost two years of development time for its hybrid on-premise/SaaS archiving offering.” The same is true on the e-discovery side, where companies like Clearwell have hundreds of corporate customers for on-premise ECA and review.

The second reason to doubt why the combined company will be any more successful than either were before the acquisition is that Mimosa and Iron Mountain Digital serve very different markets. Most of Mimosa’s customers are small to medium sized companies; most of Iron Mountain Digital (ie., Stratify)’s revenue comes from law firms. So it’s not obvious that by combining them you create a company well-suited to serving large corporations, which is the sweet spot for e-discovery and archiving.

It will be interesting to watch events unfold.

Learn More On Litigation Support Software & Electronic Discovery Litigation