Just A Life Sciences Blog...: Making A Biologic At Scale -- A Challenge Uniquely Suited To REAL "Big Data" Analysis

Friday, April 4, 2014

Making A Biologic At Scale -- A Challenge Uniquely Suited To REAL "Big Data" Analysis

As anyone who has worked around a vaccine shop of size knows, there may be no scientific task more daunting, than growing a perfectly-uniform batch of organisms, from scratch, in the multiple hundreds of thousands of doses scale.

Imagine trying to simultaneously grow upwards of six hundred thousand purple pansies -- not one of them with a single blemish, on any petal. At all. Now imagine having to do that over and over, 52 weeks a year. Oh. And yes -- every single bit of contaminant, biological or substrate -- must vanish before these mythical purple pansies are packed and shipped.

But that is the eye of the needle through which vaccines -- and many of the other newer biologic agents -- must be passed. In the early 2000s, Baxter could not keep a steady flow of its anti-hemophiliac biologic blood factor reliably streaming from the end of the line of its clearly state of the art facility in Europe, no matter how hard it tried. Similarly, in the 2010 to 2012 timeframe, Merck and MRL could not reliably keep the supplies flowing, on certain vaccines (like its Hep B vaccine) -- and likely considered exiting the market altogether.

So. . . unlike a lot of the bluster about uses for big data crunching -- this infinitely variable biological problem set is truly an in the wild environment where massively relational computing might help tease out real root causes of production anomalies. Cue the Info-media machines -- yes, a feature story follows -- on Merck. Diagnosing a vaccine problem, using real big data approaches. Nice. Do go read it all -- but here is a bit:

. . . .By early 2013, a Merck team was experimenting with a massively scalable distributed relational database. But when Llado and Megaro learned that Merck Research Laboratories (MRL) could provide their team with cloud-based Hadoop compute, they decided to change course.

Built on a Hortonworks Hadoop distribution running on Amazon Web Services, MRL's Merck Data Science Platform turned out to be a better fit for the analysis because Hadoop supports a schema-on-read approach. As a result, data from 16 disparate sources could be used in analysis without having to be transformed with time-consuming and expensive ETL processes to conform to a rigid, predefined relational database schema.

"We took all of our data on one vaccine, whether from the labs or the process historians or the environmental systems, and just dropped it into a data lake," says Llado. . . .

And in the end, they figured it out. Amazing. Know that the next biologic production slowdown (and chaos theory posits that such events are a near certainty, over a large enough n. . .) will likely turn out to be due to the confluence of twenty or thirty wholly-new, and very subtle, biological shifts. . . even ambient external plant humiditiy for example. So I salute the REAL "big data" hunters and gatherers here.

And remember -- unlike the competitive gardener, who only loses "best in show", if her pansies are blemished. . . people may die, if even one vial -- of 600,000 doses or so, of a given vaccine is biologically inactive -- or contains any form of particulate. The idea that we -- in the main -- see a safe, stable supply of any widely available modern biologic -- is a BIG science. . . miracle. Yes, I think that the right noun. I plainly do intend to mix cosmology and mysticism here -- with pure inductive scientific reasoning. It is a modern miracle. Thank you, MRL -- and Baxter.

[And. . .what the heck did they put in MY coffee? Wow.]

No comments:

Post a Comment

Legal Stuff; Creative Commons Statement 2008-2021

Nothing written, appearing, or linked to, on this site is intended to be individual legal, or investment, advice. Consult a financial or legal adviser before making any trade, or any other decision, based anything you read, or see, on this website. This website treats all U.S. viewers' visitor-paths -- and visits -- as public data. If you are from Europe, understand that this site can see -- but will not disclose to the public -- your visitor-path, in compliance with applicable E.U. directives. We also (via enabled cookies) receive useful information about the device and software you use to access these pages, including IP address (from which location may be inferred), device type, web browser type, operating system version, phone carrier and manufacturer, member agents, application installations, device identifiers, mobile advertising identifiers, and push notification tokens. We do not sell any of this information, however -- we are not for profit, here. We automatically receive information about your interactions with these pages, such as the posts or other content you view, the searches you conduct, the people you follow, and the dates and times of your visits. We may collect (for our own use only) information using cookies, pixel tags, and similar technologies. Cookies are small text files containing a string of alphanumeric characters. We may use both session cookies and persistent cookies. A session cookie disappears after you close your browser. A persistent cookie remains after you close your browser and may be used by your browser on subsequent visits to our pages.

All material on this website is derived from public documents, and/or edited, modified, and derived from public domain sources, or in some cases, originally-created by the author(s) of this site. Any use of any proprietary image, document or other data is genuinely-intended by the author(s) to fall under the common-law "fair use" doctrine, as criticism of, and commentary on, matters of substantial public concern -- among them, the for-profit health care system in the Americas. If any person wishes to dispute this assertion of "fair use", please leave a comment in any of the comment-boxes, specifically-identifying the challenged material, and the basis for the challenge, on this site. The Site Administrator(s) will promptly consider the claim. In the same comment-box, the Site Administrator(s) will indicate the site's position on any such claim. This site is not-for-profit. Share, and share-a-like, licenses granted in, and to, all content. Copy-left 2007 through 2025.

Friday, April 4, 2014

Making A Biologic At Scale -- A Challenge Uniquely Suited To REAL "Big Data" Analysis

No comments:

Of Banksy's Brilliance...

The Sacklers, Revisited?

NASA Updates

US Science History...

Of Kenosha 2020...

Women's March -- Year Two!

A Place For S-P Alums

Vioxx In the Supreme Court

CEO Hassan's $1.2B Pharmacia Role. . .

What Caused The "Bust-Up"?

Of Direct Marketing Miscues

Hassan and Cox, Part Deux?

DOCTOR Stephen Colbert!

Blog Archive

Senator Grassley's Concern

stats

The Condor. . . .

Legal Stuff; Creative Commons Statement 2008-2021

blogger.com is great!