Data Software Engineering Daily

Informações:

Sinopsis

Databases and data engineering episodes of Software Engineering Daily

Episodios

  • Observability Using Honeycomb.io with Christine Yen

    08/11/2021 Duración: 49min

    It does not matter if it runs on your machine.  Your code must run in the production environment and it must do so performantly.  For that, you need tooling to better understand your application’s behavior under different circumstances.  In the earliest days of software development, all we had were logs, which are still around and The post Observability Using Honeycomb.io with Christine Yen appeared first on Software Engineering Daily.

  • Location-Based Experiences Using Foursquare with Ankit Patel

    03/11/2021 Duración: 48min

    The manner in which users interact with technology has rapidly switched to mobile consumption.  The devices almost all of us carry with us at all times open endless opportunities for developers to create location-based experiences. Foursquare became a household name when the introduced social check-ins.  Today they’re a location data platform.  Ankit Patel is the The post Location-Based Experiences Using Foursquare with Ankit Patel appeared first on Software Engineering Daily.

  • Datadog with Omri Sass and Hugo Kaczmarek

    28/10/2021 Duración: 43min

    Modern business applications are complex.  It’s not enough to have raw logs or some basic telemetry.  Today’s enterprise organizations require an application performance monitoring solution or APM.  Today’s applications are complex distributed systems whose performance depends on a wide variety of factors.  Every single line of code can affect production and teams need insights into The post Datadog with Omri Sass and Hugo Kaczmarek appeared first on Software Engineering Daily.

  • Infrastructure as Code with Christian Tragesser

    08/10/2021 Duración: 43min

    Infrastructure as Code is an approach to machine provisioning and setup in which a programmer describes the underlying services they need for their projects.  However, this infrastructure code doesn’t compile a binary artifact like traditional source code.  The successful completion of running the code signals that the servers and other components described in the configuration The post Infrastructure as Code with Christian Tragesser appeared first on Software Engineering Daily.

  • Modern Data Infrastructure and Tools with Leigh Marie Braswell

    05/10/2021 Duración: 47min

    The first industrial deployments of machine learning and artificial intelligence solutions were bespoke by definition and often had brittle operating characteristics.  Almost no one builds custom databases, web servers, or email clients.  Yet technology groups today often consider developing homegrown ML and data solutions in order to solve their unique use cases.  Today’s modern data The post Modern Data Infrastructure and Tools with Leigh Marie Braswell appeared first on Software Engineering Daily.

  • Git Scales for Monorepos with Derrick Stolee

    01/10/2021 Duración: 53min

    In a version control system, a Monorepo is a version control management strategy in which all your code is contained in one potentially large but complete repository.  The monorepo is in stark contrast to an alternative approach in which software teams independently manage microservices or deliver software as libraries to be imported in other projects.  The post Git Scales for Monorepos with Derrick Stolee appeared first on Software Engineering Daily.

  • Faking Data Using Tonic.ai with Ian Coe and Adam Kamor

    29/09/2021 Duración: 50min

    Companies that gather data about their users have an ethical obligation and legal responsibility to protect the personally identifiable information in their dataset.  Ideally, developers working on a software application wouldn’t need access to production data. Yet without high-quality example data, many technology groups stumble on avoidable problems.  Organizations need a solution to protect privacy The post Faking Data Using Tonic.ai with Ian Coe and Adam Kamor appeared first on Software Engineering Daily.

  • DBT: Data Build Tool with Tristan Handy

    28/09/2021 Duración: 44min

    Applications write data to persistent storage like a database.  The most popular database query language is SQL which has many similar dialects.  SQL is expressive and powerful for describing what data you want.  What you do with that data requires a solution in the form of a data pipeline.  Ideally, these analytical workflows can follow The post DBT: Data Build Tool with Tristan Handy appeared first on Software Engineering Daily.

  • No Code Process Automation at Axiom with Yaseer Sheriff

    24/09/2021 Duración: 43min

    Tedious, repetitive tasks are better handled by machines.  Unless these tasks truly require human intelligence, repetitive tasks are often good candidates for automation.  Implementing process automation can be challenging and technical.  Increasingly, engineers are seeking out tools and platforms to facilitate faster, more reliable automation. In this episode I talk to Yaseer Sheriff, Co-Founder and The post No Code Process Automation at Axiom with Yaseer Sheriff appeared first on Software Engineering Daily.

  • LinearB with Dan Lines

    21/09/2021 Duración: 45min

    A developer’s core deliverables are individual commits and the pull requests they aggregate into. While the number of lines of code written alone may not be very informative, in total, the code and metadata about the code found in tracking systems present a rich dataset with great promise for analysis and productivity optimization insights. LinearB The post LinearB with Dan Lines appeared first on Software Engineering Daily.

  • Modern Data Stacks Optimized by Mozart Data with Peter Fishman and Dan Silberman

    14/09/2021 Duración: 50min

    Modern companies leverage dozens or even hundreds of software solutions to solve specific needs of the business.  Organizations need to collect all these disparate data sources into a data warehouse in order to add value.  The raw data typically needs transformation before it can be analyzed.  In many cases, companies develop homegrown solutions, thus reinventing The post Modern Data Stacks Optimized by Mozart Data with Peter Fishman and Dan Silberman appeared first on Software Engineering Daily.

  • Instabase with Anant Bhardwaj

    07/09/2021 Duración: 48min

    Instabase is a technology platform for building automation solutions. Users deploy it onto their own infrastructure and can leverage the tools offered by the platform to build complex workflows for handling tasks like income verification and claims processing. In this episode we interview Anant Bhardwaj, founder of Instabase. He describes Instabase as an operating system.  The post Instabase with Anant Bhardwaj appeared first on Software Engineering Daily.

  • InfluxData: Time-Series Data with Russ Savage

    19/08/2021 Duración: 43min

    Time series data are simply measurements or events that are tracked, monitored, downsampled, and aggregated over time. This could be server metrics, application performance monitoring, network data, sensor data, events, clicks, trades in a market, and many other types of analytics data (influxdata.com). The platform InfluxData is designed for building and operating time series applications. The post InfluxData: Time-Series Data with Russ Savage appeared first on Software Engineering Daily.

  • Druid: Event-Driven Data with Eric Tschetter

    16/08/2021 Duración: 56min

    Whether sending messages, shopping in an app, or watching videos, modern consumers expect information and responsiveness to be near-instant in their apps and devices. From a developer’s perspective, this means clean code and a fast database.  Apache Druid is a database built to power real-time analytic workloads for event-driven data, like user-facing applications, streaming, and The post Druid: Event-Driven Data with Eric Tschetter appeared first on Software Engineering Daily.

  • DaaS with Auren Hoffman

    13/08/2021 Duración: 01h47min

    Auren Hoffman is the CEO of SafeGraph. In this episode we discuss data as a service and more. This interview was also recorded as a video podcast. Check out the video on the Software Daily YouTube channel. Sponsorship inquiries: sponsor@softwareengineeringdaily.com The post DaaS with Auren Hoffman appeared first on Software Engineering Daily.

  • Reverse ETL: Operationalizing Data Warehouses with Tejas Manohar

    02/08/2021 Duración: 01h02min

    Enterprise data warehouses store all company data in a single place to be accessed, queried, and analyzed. They’re essential for business operations because they support managing data from multiple sources, providing context, and have built-in analytics tools. While keeping a single source of truth is important, easily moving data from the warehouse to other applications The post Reverse ETL: Operationalizing Data Warehouses with Tejas Manohar appeared first on Software Engineering Daily.

  • Prophecy: Apple of Data Engineering with Raj Bains

    28/07/2021 Duración: 58min

    Prophecy is a complete Low-Code Data Engineering Platform for the Enterprise. Prophecy enables all your teams on Apache Spark with a unique low-code designer. While you visually build your Dataflows – Prophecy generates high-quality Spark code on Git. Then, you can schedule Spark workflows with Prophecy’s low-code Airflow. Not only that, Prophecy provides end-to-end visibility The post Prophecy: Apple of Data Engineering with Raj Bains appeared first on Software Engineering Daily.

  • Pulsar Rerevisted with Enrico Olivelli

    26/07/2021 Duración: 56min

    In the previous episode, Pulsar Revisited, we discussed how the company DataStax has added to their product stack Astra Streaming, their cloud-native messaging and event streaming service that’s built on top of Apache Pulsar. We discussed Apache Pulsar and the added features DataStax offers like injecting machine learning into your data streams and viewing real-time The post Pulsar Rerevisted with Enrico Olivelli appeared first on Software Engineering Daily.

  • CockroachDB: Distributed Databases and Containerization with Spencer Kimball

    21/07/2021 Duración: 58min

    In 2003, Google developed a robust cluster management system called Borg. This enabled them to manage clusters with tens of thousands of machines, moving them away from virtual machines and firmly into container management. Then, in 2014, they open sourced a version of Borg called Kubernetes, or K8s.  Now, in 2021, CockroachDB is a distributed The post CockroachDB: Distributed Databases and Containerization with Spencer Kimball appeared first on Software Engineering Daily.

  • Imply Infra: Big Data Analysis and Real-World Examples with Jad Naous

    19/07/2021 Duración: 38min

    Big data analytics is the process of collecting data, processing and cleaning it, then analyzing it with techniques like data mining, predictive analytics, and deep learning. This process requires a suite of tools to operate efficiently. Data analytics can save companies money, drive product development, and give insight into the market and customers. The company The post Imply Infra: Big Data Analysis and Real-World Examples with Jad Naous appeared first on Software Engineering Daily.

página 4 de 5