"Agile practices can bring discipline to data science through support for the values and principles of DataOps. However, just as there is no perfect agile framework or set of practices for software development, there is no single set of best agile practices for data science. The right practices to use are context and organization specific and help data analytics teams become more adaptable and collaborative and tighten feedback loops to produce faster (and better) results. The successful application of agile and Lean thinking to data analytics requires observation, constant experimentation, and adjustment." (Harvinder Atwal, "Practical DataOps: Delivering Agile Data Science at Scale", 2020)
"DataOps does not prescribe a particular agile framework, set of practices, artifacts, or roles. Nonetheless, instead of inventing new ways of working, it is better to adapt existing proven practices and combining them as needed. The following are useful agile practices aligned with DataOps principles. However, none are compulsory, and if you find they do not add value, do not use them. Agile frameworks are not mutually exclusive, and practices are situation and context-dependent. You can explore alternatives as long as you stay true to DataOps values and principles." (Harvinder Atwal, "Practical DataOps: Delivering Agile Data Science at Scale", 2020)
"DataOps is not a project. The first iteration of DataOps is not the last. With minimum viable DataOps in place and benefits publicized, the next stage is to expand practices to more domains and epics. The second objective is to get to a tipping point where it becomes more compelling to continue the journey of implementing DataOps practices, principles, and values than to resist them. Results speak louder than words. It remains essential to avoid diversions into time-consuming political battles with hard-to-change departments or to waste time developing a large-scale change management program." (Harvinder Atwal, "Practical DataOps: Delivering Agile Data Science at Scale", 2020)
"DataOps methodology is the best way to eliminate barriers, collaborate, and maximize the chances of success. DataOps turns data science and analytics from the craft industry it is today in most organizations into a slick manufacturing operation. DataOps enables rapid data product development and creates an assembly line that converts raw data from multiple sources into production data products with a minimum of waste. (Harvinder Atwal, "Practical DataOps: Delivering Agile Data Science at Scale", 2020)
"DataOps requires that data scientists, data analysts, and data engineers have quick access to data, tools, and infrastructure to eliminate bottlenecks. That is, they need to be able to access, add, or modify data quickly by themselves. We term this availability to data self-service. Through self-service, data analytics professionals can create data products in far less time than with traditional approaches." (Harvinder Atwal, "Practical DataOps: Delivering Agile Data Science at Scale", 2020)
"DataOps should be part of a well-thought-out data strategy that lays the foundation for a transformation. Actually, all organizations that want to use data for data-sharing or analytical purposes need a data strategy. The only variation will be the depth of strategy and complexity of use cases. A start-up’s data strategy might not need the same detail and span as a multinational corporation, but it should still define a means to prepare for the future." (Harvinder Atwal, "Practical DataOps: Delivering Agile Data Science at Scale", 2020)
"DevOps effectiveness increases when there is less heterogeneity in the technology stack. Complexity increases the probability of errors and slows down the flow of deployment because teams find it hard to scale their expertise and apply consistent patterns across data pipelines. The focus of data analytics teams after adopting version control should be to standardize and simplify the set of technologies they use from languages, libraries, and databases to data engineering tools." (Harvinder Atwal, "Practical DataOps: Delivering Agile Data Science at Scale", 2020)
"The final stage of DevOps for DataOps is to automate the build of pipeline environments and give data pipeline developers self-serve ability to create, test, and deploy changes." (Harvinder Atwal, "Practical DataOps: Delivering Agile Data Science at Scale", 2020)
"Unless dealing with a greenfield situation, it is not possible to jump straight to the end state of all analytical work following the DataOps methodology. Respecting the principles of agile and DevOps, the movement to the end goal must be in iterative, small, and frequent steps." (Harvinder Atwal, "Practical DataOps: Delivering Agile Data Science at Scale", 2020)
"While there are undoubtedly success stories, there is also plenty of evidence that substantial investment in data science is not generating the returns expected for a majority of organizations." (Harvinder Atwal, "Practical DataOps: Delivering Agile Data Science at Scale", 2020)