

MLOps.community
Demetrios
Relaxed Conversations around getting AI into production, whatever shape that may come in (agentic, traditional ML, LLMs, Vibes, etc)
Episodes
Mentioned books

Feb 21, 2022 • 51min
Trustworthy Data for Machine Learning // Chad Sanderson // MLOps Meetup #93
MLOps Community Meetup #93! Two weeks ago, we talked to Chad Sanderson, Trustworthy Data for Machine Learning.Join the Community: https://go.mlops.community/YTJoinInGet the newsletter: https://go.mlops.community/YTNewsletter// Abstract The most common challenge for ML teams operating at scale is data quality.In this talk, Chad discusses how Convoy invested in a large-scale data quality effort to treat data as an API and provide a data change management surface to enable trustworthy machine learning.// Bio Chad Sanderson is the Product Lead for Convoy's Data Platform team, which includes the data warehouse, streaming, BI & visualization, experimentation, machine learning, and data discovery.Chad has built everything from feature stores, experimentation platforms, metrics layers, streaming platforms, analytics tools, data discovery systems, and workflow development platforms. He’s implemented open source, SaaS products (early and late-stage) and has built cutting-edge technology from the ground up. Chad loves the data space, and if you're interested in chatting about it with him, don't hesitate to reach out.// Related links ----------- ✌️Connect With Us ✌️------------- Join our Slack community: https://go.mlops.community/slackFollow us on Twitter: @mlopscommunity Sign up for the next meetup: https://go.mlops.community/registerCatch all episodes, Feature Store, Machine Learning Monitoring, and Blogs: https://mlops.community/ Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/Connect with Chad on LinkedIn: https://www.linkedin.com/in/chad-sanderson/Timestamps:[00:00] Introduction to Chad Sanderson[00:30] Chad's journey to Convoy[04:25] Evolution of Convoy's platform[10:33] Definition and measurement data quality of KPI's[13:36] COVID-19 effect on the distribution data supply chain[17:15] Justifying investments in ML[20:00] Examples of data Convoy deals with and models they build[20:50] Examples of techniques Convoy uses to maintain quality data[21:00] Concept of a Data Contract[21:53] Enterprise Data Model[25:13] Feature store and reuse or use by the business[28:32] Impact of COVID-19 on the data quality process[31:54] Software engineers' reactions to the implementation of ideas[33:21] Other value props [37:54] Point of a framework to step back from full automation[41:26] War stories[45:49] Metrics layer[50:17] Convoy is hiring!!! Reach out to Chad at https://www.linkedin.com/in/chad-sanderson/ or chad.sanderson@convoy.com

Feb 15, 2022 • 47min
Practitioners Guide to MLOps // Donna Schut and Christos Aniftos // Coffee Sessions #82
MLOps Coffee Sessions #82 with Donna Schut and Christos Aniftos, Practitioners' Guide to MLOps.Join the Community: https://go.mlops.community/YTJoinInGet the newsletter: https://go.mlops.community/YTNewsletter// AbstractThe "Practitioners Guide to MLOps" introduced excellent frameworks for how to think about the field. Can we talk about how you've seen the advice in that guide applied to real-world systems? Is there additional advice you'd add to that paper based on what you've seen since its publication and with new tools being introduced?Your article about selecting the right capabilities has a lot of great advice. It would be fun to walk through a hypothetical company case and talk about how to apply that advice in a real-world setting.GCP has had a lot of new offerings lately, including Vertex AI. It would be great to talk through what's new and what's coming down the line. Our audience always loves hearing how tool providers like GCP think about the problems customers face and how tools are correspondingly developed.// BioDonna SchutDonna is a Solutions Manager at Google Cloud, responsible for designing, building, and bringing to market smart analytics and AI solutions globally. She is passionate about pushing the boundaries of our thinking with new technologies and creating solutions that have a positive impact. Previously, she was a Technical Account Manager, overseeing the delivery of large-scale ML projects, and part of the AI Practice, developing tools, processes, and solutions for successful ML adoption. She managed and co-authored Google Cloud’s AI Adoption Framework and Practitioners' Guide to MLOps.Christos AniftosChristos is a machine learning engineer with a focus on the end-to-end ML ecosystem. On a typical day, Christos helps Google customers productionize their ML workloads using Google Cloud products and services with special attention to scalable and maintainable ML environments.Christos made his ML debut in 2010 while working at DigitalMR, where he led a team of data scientists and developers to build a social media monitoring & analytics tool for the Market Research sector.// Related links: Select the Right MLOps Capabilities for Your ML Use Case https://cloud.google.com/blog/products/ai-machine-learning/select-the-right-mlops-capabilities-for-your-ml-use-casePractitioner's Guide to MLOps white paper https://services.google.com/fh/files/misc/practitioners_guide_to_mlops_whitepaper.pdf--------------- ✌️Connect With Us ✌️ -------------Join our Slack community: https://go.mlops.community/slackFollow us on Twitter: @mlopscommunitySign up for the next meetup: https://go.mlops.community/registerCatch all episodes, blogs, newsletter, and more: https://mlops.community/Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/Connect with Vishnu on LinkedIn: https://www.linkedin.com/in/vrachakonda/Connect with Donna on LinkedIn: https://www.linkedin.com/in/donna-schut/Connect with Christos on LinkedIn: https://www.linkedin.com/in/aniftos/Timestamps: [00:00] Introduction to Donna Schut and Christos Aniftos [05:52] Inspiration of Practitioner's Guide to MLOps paper [06:57] Model for working with customers [08:14] Where are we at MLOps? [10:20] Working with customers [11:30] Practitioner's Guide to MLOps paper [16:16] Training maturity levels [22:37] Context about the discovery process [25:21] Disciplines and security [26:12] Is there a level up in maturity? [29:50] Successes or failures that stand out [38:00] War stories [43:16] Wrap up

Feb 14, 2022 • 49min
Investing in MLOps // Leigh Marie Braswell and Davis Treybig // MLOps Coffee Sessions #81
MLOps Coffee Sessions #81 with Davis Treybig and Leigh Marie Braswell, Machine Learning from the Viewpoint of Investors.Join the Community: https://go.mlops.community/YTJoinInGet the newsletter: https://go.mlops.community/YTNewsletter// AbstractMachine learning is a rapidly evolving space that can be hard to keep track of. Every year, thousands of research papers are published in the space, and hundreds of new companies are built both in applied machine learning as well as in machine learning tooling.In this podcast, we interview two investors who focus heavily on machine learning to get their take on the state of the machine learning industry today: Leigh-Marie Braswell at Founders Fund and Davis Treybig at Innovation Endeavors. We discuss their perspectives on opportunities within MLOps and applied machine learning, common pitfalls and challenges seen in machine learning startups, and new projects they find exciting and interesting in the space.// BioDavis TreybigDavis (email: davis@innovationendeavors.com) is currently a principal on the investment team at Innovation Endeavors, an early-stage venture firm focused on highly technical companies. He primarily focuses on software infrastructure, especially data tooling and security. Prior to Innovation Endeavors, Davis was a product manager at Google, where he worked on the Pixel phone and the developer platform for the Google Assistant. Davis studied computer science and electrical engineering in college.Leigh Marie BraswellLeigh Marie (Twitter: @LM_Braswell) is an investor at Founders Fund. Before joining Founders Fund, she was an early engineer & the first product manager at Scale AI, where she originally built & later led product development for the LiDAR/3D annotation products, used by many autonomous vehicles, robots, and AR/VR companies as a core step in their machine learning lifecycles. She has also done software development at Blend, machine learning at Google, and quantitative trading at Jane Street.--------------- ✌️Connect With Us ✌️ -------------Join our Slack community: https://go.mlops.community/slackFollow us on Twitter: @mlopscommunitySign up for the next meetup: https://go.mlops.community/registerCatch all episodes, blogs, newsletter, and more: https://mlops.community/Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/Connect with Leigh on LinkedIn: https://www.linkedin.com/in/leigh-marie-braswell/Connect with Davis on LinkedIn: https://www.linkedin.com/in/davistreybig/Timestamps:[00:00] Introduction to Leigh Marie Braswell and Davis Treybig[03:23] Where are we now in MLOps?[05:50] Ripe for consolidation[13:08] Real pain to solve[18:20] Modern data stack vs modern ML stack[25:25] Strong use cases for ML with a huge sea of long-tail[28:43] A funny meme with a Huggingface[32:23] Looking at open-source as an investment[36:44] Tips and tricks to rally a team and a vision for Startups [43:55] What surprised you over the last year?[47:16] Societal norms and acceptance of things[47:55] Where to get in touch with Leigh Marie and Davis: Leigh Marie - Twitter: @LM_Braswell Davis - email: davis@innovationendeavors.com

Feb 8, 2022 • 42min
The Journey from Data Scientist to MLOps Engineer // Ale Solano // MLOps Coffee Sessions #80
MLOps Coffee Sessions #80 with Ale Solano, The Journey from Data Scientist to MLOps Engineer.Join the Community: https://go.mlops.community/YTJoinInGet the newsletter: https://go.mlops.community/YTNewsletter// Abstract After years of failed POCs, then all of a sudden, one of our models is accepted and will be used in production. The next morning, we are part of the main scrum stand-up meeting, and a DevOps guy is assisting us. A strange feeling, unknown to us until then, starts growing on the AI team: we are useful!Deploying models to production is challenging, but MLOps is more than that. MLOps is about making an AI team useful and iterative from the beginning. And it requires a role that takes care of the technical challenges that this implies, given the experimental nature of the ML field, while also serving the product and business needs. If your AI team does not include this role, maybe it's your time to step up and do it yourself! Today, we will chat with Ale about the transition from being a data scientist to a self-called MLOps engineer. And yes, you'll need to study computer science.// Bio Ale was born and raised in a mid-sized town near Malaga in southern Spain. Ale did his bachelor's degree in robotics because it sounded cool, and then he got into machine learning because it was even cooler.Ale worked in two companies as an ML developer. Now he's on a temporary hiatus to study business and computer science and get a motivation boost.--------------- ✌️Connect With Us ✌️ ------------- Join our Slack community: https://go.mlops.community/slackFollow us on Twitter: @mlopscommunity Sign up for the next meetup: https://go.mlops.community/registerCatch all episodes, blogs, newsletter, and more: https://mlops.community/Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/Connect with Adam on LinkedIn: https://www.linkedin.com/in/aesroka/Connect with Ale on LinkedIn: https://www.linkedin.com/in/alesolano/Timestamps: [00:00] Brief Introduction to Adam Sroka[00:47] Takeaways[04:27] Support the community![05:37] Introduction to Ale Solano[06:52] How Ale Solano got into MLOps[09:16] Getting aboard the ML train[10:51] Robotics to Computer Science[14:54] Early MLOps headaches[16:54] SPRINT by Jake Knapp[17:58] Starting to implement MLOps[19:44] Major adjustment[21:34] Biggest wins[22:49] Importance of CICD[24:50] Major Stakeholders of Ale's ML team[26:55] The dream the community must have[30:33] Recognizing the foundational pieces and the evolution[33:13] What is Ale excited about, and what is missing[34:36] Different fields of expertise[36:49] Ale's take on "80% of the models don't make it to production"[39:15] Wrap up

17 snips
Feb 4, 2022 • 52min
Platform Thinking: A Lemonade Case Study // Orr Shilon // MLOps Coffee Sessions #79
MLOps Coffee Sessions #79 with Orr Shilon, Platform Thinking: A Lemonade Case Study. Join the Community: https://go.mlops.community/YTJoinInGet the newsletter: https://go.mlops.community/YTNewsletter// Abstract This episode is the epitome of why people listen to our podcast. It’s a complete discussion of the technical, organizational, and cultural challenges of building a high-velocity, machine learning platform that impacts core business outcomes. Orr tells us about the focus on automation and platform thinking that’s uniquely allowed Lemonade’s engineers to make long-term investments that have paid off in terms of efficiency. He tells us the crazy story of how the entire data science team of 20+ people was supported by only 2 ML engineers at one point, demonstrating the leverage their technical strategy has given engineers. // Bio Orr is an ML Engineering Team Lead at Lemonade, currently working on an ML Platform, empowering Data Scientists to manage the ML lifecycle from research to development and monitoring. Previously, Orr worked at Twiggle on semantic search, at Varonis on data governance, and at Intel. He holds a B.Sc. in Computer Science and Psychology from Tel Aviv University. Orr also enjoys trail running and sometimes races competitively. --------------- ✌️Connect With Us ✌️ ------------- Join our Slack community: https://go.mlops.community/slackFollow us on Twitter: @mlopscommunity Sign up for the next meetup: https://go.mlops.community/registerCatch all episodes, blogs, newsletter, and more: https://mlops.community/ Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/Connect with Vishnu on LinkedIn: https://www.linkedin.com/in/vrachakonda/Connect with Orr on LinkedIn: https://www.linkedin.com/in/orrshilon/Timestamps: [00:00] Takeaways[05:31] Introduction to Orr Shilon[06:00] Looking for editors[07:29] What is Lemonade and what does it do?[08:43] Machine Learning in Lemonade[10:06] End-to-end process of ML in Lemonade[13:00] Recycling features[14:52] Slack bot, Cooper[16:15] Importance of automation in Lemonade[18:11] Circumstances when automation is unnecessary[19:44] Slack bot platform[20:31] ML tools used by Lemonade[23:45] Areas of friction[26:15] Model serving framework[28:30] Ownership models[30:36] Facing challenges[31:52] Theory about lack of talents[34:13] Orr Shilon's team[37:51] Continuation of the building blocks[39:12] Testing challenges[42:30] Clear vision of the Lemonade team[44:46] Demetrios was ghosted by the head of data science at Lemonade[45:46] Platform thinking[47:27] What is a "Business point in time"?[50:24] Wrap up

Jan 31, 2022 • 50min
Calibration for ML at Etsy - apply() special // Erica Greene and Seoyoon Park // MLOps Coffee Sessions #78
MLOps Coffee Sessions #78 with Erica Greene and Seoyoon Park, Calibration for ML at Etsy - apply() special.Join the Community: https://go.mlops.community/YTJoinInGet the newsletter: https://go.mlops.community/YTNewsletter// Abstract This is a special conversation about Machine Learning calibration at Etsy. Demetrios sat down with Erica Greene and Seoyoon Park to hear about how they implemented Calibration into the Etsy Machine Learning workflow.The conversation is a pre-chat with these two before their presentation at the apply() conference on February 10th. Register here: applyconf.com// Bio Erica Geen Erica is an engineering manager with a background in machine learning. She's passionate about developing programs and policies that support women and other underrepresented groups in technology.Seoyoon Park Backend software engineer and aspiring software architect interested in producing scalable, performant, and fault-tolerant applications by keeping up to date with best practices and industry standards. Seoyoon strives to better himself and his peers by advocating for frequent knowledge transfers and promoting a culture of continuous learning. Constantly looking for opportunities to grow as a developer and become a leader of the industry.--------------- ✌️Connect With Us ✌️ ------------- Join our Slack community: https://go.mlops.community/slackFollow us on Twitter: @mlopscommunity Sign up for the next meetup: https://go.mlops.community/registerCatch all episodes, blogs, newsletter, and more: https://mlops.community/Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/Connect with Erica on LinkedIn: https://www.linkedin.com/in/ericagreene/Connect with Seoyoon on LinkedIn: https://www.linkedin.com/in/seoyoonpark/Timestamps:[00:00] Quick notes[00:47] Introduction to Erica Greene and Seoyoon Park[02:37] Looking for an editor[04:23] Seoyoon Park's background[06:01] Erica Greene's background[08:07] Seoyoon's transition to ML[10:25] Erica's take as team manager[11:58] Additional points from Seoyoon[13:17] Early wins in ML[15:41] Seoyoon's start in ML[17:54] Three core things for Erica[19:07] What is calibration?[22:39] Calibration uses[24:46] Shift in different models[26:11] On Calibration of Modern Neural Networks[26:52] Importance of Calibration to models[28:31] Implementation of Calibrating[31:24] Calibration metrics[35:07] Calibration testing[36:50] The bug encounter[39:09] Debugging the fault[43:16] Erica's war story[46:01] Seoyoon's war story[48:31] Wrap up

Jan 28, 2022 • 57min
Data Mesh - The Data Quality Control Mechanism for MLOps? // Scott Hirleman // MLOps Coffee Sessions #77
MLOps Coffee Sessions #77 with Scott Hirleman, Data Mesh - The Data Quality Control Mechanism for MLOps?Join the Community: https://go.mlops.community/YTJoinInGet the newsletter: https://go.mlops.community/YTNewsletter// Abstract Scott covers what a data mesh is at a high level for those not familiar. Data mesh is potentially a great win for ML/MLOps as there is very clear guidance on creating useful, clean, well-documented/described, and interoperable data for "unexpected use". So instead of data spelunking being a harrowing task, it can be a very fruitful one. And that one data set that was so awesome?Well, it wasn't a one-off; it's managed as a product with regular refreshes! And there is a LOT more ownership/responsibility on data producers to make sure the downstream doesn't break. Might sound like kumbaya for MLOps (or total BS?) re far cleaner data and fewer upstream breaks, so let's discuss the realities and limitations!// Bio A self-professed "chaotic (mostly) good character", Scott is focused on helping the data mesh community accelerate towards finding solutions for some of data management's hardest challenges. He founded the Data Mesh Learning community specifically to gather enough people to exchange ideas, much of which is patterned after the MLOps community. He hosts the Data Mesh Radio podcast, where he dives deep into topics related to data mesh to provide the data community with useful perspectives and thoughts on data mesh.--------------- ✌️Connect With Us ✌️ ------------- Join our Slack community: https://go.mlops.community/slackFollow us on Twitter: @mlopscommunity Sign up for the next meetup: https://go.mlops.community/registerCatch all episodes, blogs, newsletter, and more: https://mlops.community/Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/Connect with Adam on LinkedIn: https://www.linkedin.com/in/aesroka/Connect with Scott on LinkedIn: https://www.linkedin.com/in/scotthirleman/Timestamps:[00:00] Takeaways[04:47] Merchandise[05:50] What is data mesh?[08:17] What is a data product?[11:14] Second layer of data mesh[13:15] Data standards[15:51] Third layer of data mesh[17:13] Cultural aspect of data mesh[21:56] Data mesh documentation[24:29] Tooling challenges[27:55] Data mesh in practice[31:40] Difference in experiences[36:05] Baby steps to a fully pledged data mesh[42:05] How data mesh relates to ML[48:30] Data mesh vs data mess jokes[49:02] High risks in data mesh[52:47] Quick wins[56:10] Wrap up

Jan 25, 2022 • 51min
Build a Culture of ML Testing and Model Quality // Mohamed Elgendy // MLOps Coffee Sessions #76
MLOps Coffee Sessions #76 with Mohamed Elgendy, Build a Culture of ML Testing and Model Quality.Join the Community: https://go.mlops.community/YTJoinInGet the newsletter: https://go.mlops.community/YTNewsletter// Abstract Machine learning engineers and data scientists spend most of their time testing and validating their models’ performance. But as machine learning products become more integral to our daily lives, the importance of rigorously testing model behavior will only increase.Current ML evaluation techniques are falling short in their attempts to describe the full picture of model performance. Evaluating ML models by only using global metrics (like accuracy or F1 score) produces a low-resolution picture of a model’s performance and fails to describe the model's performance across types of cases, attributes, and scenarios.It is rapidly becoming vital for ML teams to have a full understanding of when and how their models fail and to track these cases across different model versions to be able to identify regression. We’ve seen great results from teams implementing unit and functional testing techniques in their model testing. In this talk, we’ll cover why systematic unit testing is important and how to effectively test ML system behavior.// Bio Mohamed is the Co-founder & CEO of Kolena and the author of the book “Deep Learning for Vision Systems”. Previously, he built and managed AI/ML organizations at Amazon, Twilio, Rakuten, and Synapse. Mohamed regularly speaks at AI conferences like Amazon's DevCon, O'Reilly's AI conference, and Google's I/O.--------------- ✌️Connect With Us ✌️ ------------- Join our Slack community: https://go.mlops.community/slackFollow us on Twitter: @mlopscommunity Sign up for the next meetup: https://go.mlops.community/registerCatch all episodes, blogs, newsletter, and more: https://mlops.community/Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/Connect with Adam on LinkedIn: https://www.linkedin.com/in/aesroka/Connect with Mohamed on LinkedIn: https://www.linkedin.com/in/moelgendy/Timestamps:[00:00] Takeways[04:41] Why do ML Testing?[08:41] Kolena's main goal[09:41] Difference of ML Testing from others[13:12] Importance of a knowledge base in the organization[17:53] Computational cost issues from testing[20:48] Convincing people to do more testing[23:13] Testing resources recommendations[25:15] How to get good at testing[28:19] Dealing with ML regulations [30:57] Identifying failure modes[38:57] Test-centric development for production ML[40:53] Identifying scenarios[43:37] Computer vision samples in structured data[46:10] "Deep Learning for Vision Systems" by Mohamed Elgendy[49:36] Wrap up

20 snips
Jan 21, 2022 • 57min
Towards Observability for ML Pipelines // Shreya Shankar // MLOps Coffee Sessions #75
MLOps Coffee Sessions #75 with Shreya Shankar, Towards Observability for ML Pipelines.Join the Community: https://go.mlops.community/YTJoinInGet the newsletter: https://go.mlops.community/YTNewsletter// AbstractAchieving observability in ML pipelines is a mess right now. We are tracking thousands of means, percentiles, and KL divergences of features and outputs in a haphazard attempt to figure out when and how to retrain models.In this session, we break down current unsuccessful approaches and discuss the path towards effectively maintaining ML models in production. Along the way, we introduce mltrace -- a preliminary open source project striving towards "bolt-on" observability in ML pipelines.// BioShreya Shankar is a computer scientist living in the Bay Area. She's interested in building systems to operationalize machine learning workflows. Shreya's research focus is on end-to-end observability for ML systems, particularly in the context of heterogeneous stacks of tools.Currently, Shreya is doing her Ph.D. in the RISE lab at UC Berkeley. Previously, she was the first ML engineer at Viaduct, did research at Google Brain, and completed her BS and MS in computer science at Stanford University.// Related Links Shreya Shankar's blog posts: https://www.shreya-shankar.com/ Shreya Shankar's Podcasts: https://www.listennotes.com/top-episodes/shreya-shankar/ The deployment phase of machine learning by Benedict Evans: https://www.ben-evans.com/benedictevans/2019/10/4/machine-learning-deployment Shreya Shrankar's mltrace blogpost: https://www.shreya-shankar.com/introducing-mltrace/--------------- ✌️Connect With Us ✌️ -------------Join our Slack community: https://go.mlops.community/slackFollow us on Twitter: @mlopscommunitySign up for the next meetup: https://go.mlops.community/registerCatch all episodes, blogs, newsletters, and more: https://mlops.community/Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/Connect with Vishnu on LinkedIn: https://www.linkedin.com/in/vrachakonda/Connect with Shreya on LinkedIn: https://www.linkedin.com/in/shrshnkTimestamps: [00:00] Introduction to Shreya Shankar [01:12] Shreya's background [03:22] Contrast in scale influence [05:28] Embedding ML and building machine learning infused products [07:26] Management structure and professional incentive [08:25] Organizational side of MLOps retros [10:15] Tooling implementations [12:00] Structured rational investment hardships [13:17] Working at a start-up [14:02] Academic work and entrepreneurial ambitions [16:00] ML Monitoring Observability interest [17:14] Where to get started [20:47] Realization while at Viaduct [23:30] Preventing alert fatigue [27:04] Tooling bridging the gap [30:40] Juncture at the overall MLOps ecosystem [33:58] The deployment phase of machine learning - it's the new SQL by Benedict Evans [35:30] Model monitoring [36:16] mltrace [38:28] Introducing the mltrace blog post series [41:25] Tips to our content creators/writers [43:47] Monitoring through the lens of the database [47:37] Advice about picking up ML engineering and ML systems development in 2022 [49:36] Database low down the stack [50:51] Most excited about 2022 [52:13] What MLOps space/ecosystem should change? [53:21] Funding has changed the incentives around innovation [54:52] Competition in million-dollar rounds [55:25] Starting a company [56:30] Wrap up

Jan 19, 2022 • 51min
Scaling Biotech // Jesse Johnson // MLOps Coffee Sessions #74
MLOps Coffee Sessions #74 with Jesse Johnson, Scaling Biotech.Join the Community: https://go.mlops.community/YTJoinInGet the newsletter: https://go.mlops.community/YTNewsletter// AbstractScaling a biotech research platform requires managing organization complexity - teams, functions, projects - rather than just the traditional volume, velocity, and variety. By examining the processes and experiments that drive the platform, you can focus your work where it matters the most by finding the ideal balance for each type of experiment, along with a number of common trade-offs.// BioJesse Johnson is head of Data Science and Data Engineering at Dewpoint Therapeutics, an R&D-stage biotech startup. His interest in exploring complex systems, understanding what makes them tick, then using this understanding to improve and scale them led him from academic mathematics into software engineering (Google, Verily Life Sciences), and then to Biotech (Sanofi, Cellarity, Dewpoint). His goal is to identify ways to scale biotech research through better software and organizational design.// Related Links Jessie's blogposts: scalingbiotech.com--------------- ✌️Connect With Us ✌️ -------------Join our Slack community: https://go.mlops.community/slackFollow us on Twitter: @mlopscommunitySign up for the next meetup: https://go.mlops.community/registerCatch all episodes, blogs, newsletters, and more: https://mlops.community/Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/Connect with Vishnu on LinkedIn: https://www.linkedin.com/in/vrachakonda/Connect with Jesse on LinkedIn: https://www.linkedin.com/in/jesse-johnson-51619a7/Timestamps:[00:00] Introduction to Jesse Johnson[05:10] Jesse's background[05:52] Biotech environments[06:31] Jesse's background in Biotech companies[09:21] Jesse's journey from academic to software engineering[12:20] Transition from primary output insights/research into writing code[14:54] Actual hands-on use case in practice[19:19] Jesse's career trajectory[23:57] Where we're at, state-of-the-art data engineering and its outstanding challenges[26:50] Dewpoint's data and machine learning challenges and tooling[29:04] Dewpoint's team structure[30:20] Jesse, the VP of Data Science and Data Engineering[33:24] New biotech data makes it hard to design a data platform[35:35] Changes in how biotech data is viewed[35:54] Experiment data output[40:19] Solving challenges in structuring real-world context into interpretable data fields[44:16] Maturity between the current data engineering and MLOps tooling space [47:31] Achieving a blogpost mission in 2022[49:50] Wrap up


