Data Software Engineering Daily

Informações:

Sinopsis

Databases and data engineering episodes of Software Engineering Daily

Episodios

  • Low-Code SQL on dbt Core with Raj Bains from Prophecy

    26/05/2023 Duración: 54min

    In this podcast episode, we take a look at the intricacies of low-code data pipelines with Raj Bains, the founder of Prophecy.io. Raj shares valuable insights into how performant low-codedata pipelines are revolutionizing industries and transforming everyday operations. Raj discusses the founding story of Prophecy.io, the company’s mission, and its approach to democratizing the creation The post Low-Code SQL on dbt Core with Raj Bains from Prophecy appeared first on Software Engineering Daily.

  • Open-Source Embedding Database with Anton Troynikov

    20/04/2023 Duración: 32min

    Chroma is an open source embedding database that is designed to make it easy to build large language model applications by making knowledge, facts and skills pluggable. Anton Troynikov is the co-founder of Chroma and he is our guest today. This episode is hosted by Lee Atchison. Lee Atchison is a software architect, author, and The post Open-Source Embedding Database with Anton Troynikov appeared first on Software Engineering Daily.

  • Data Activation with Tejas Manohar

    13/04/2023 Duración: 41min

    Data Activation is the method of unlocking the knowledge sorted within your data warehouse, and making it actionable by your business users in the end tools that they use every day. In doing so, Data Activation helps bring data people toward the center of the business, directly tying their work to business outcomes. Hightouch is The post Data Activation with Tejas Manohar appeared first on Software Engineering Daily.

  • Self-Service Data Culture with Stemma’s Mark Grover

    07/04/2023 Duración: 46min

    A data catalog provides an index into the data sets and schemas of a company.Data teams are growing in size, and more companies than ever have a data team, so the market for data catalog is larger than ever. Mark is the CEO of Stemma and the co-creator of Amundsen, a data catalog that came The post Self-Service Data Culture with Stemma’s Mark Grover appeared first on Software Engineering Daily.

  • Streaming Analytics with Hojjat Jafarpour

    06/04/2023 Duración: 46min

    Streaming analytics refers to the process of analyzing real-time data that is generated continuously and rapidly from various sources, such as sensors, applications, social media, and other internet-connected devices. Streaming analytics platforms enable organizations to extract business value from data in motion, similar to how traditional analytics tools derive insights from data at rest. DeltaStream The post Streaming Analytics with Hojjat Jafarpour appeared first on Software Engineering Daily.

  • Turso: Globally Replicated SQLite with Glauber Costa

    03/04/2023 Duración: 56min

    Distributed databases are necessary for storing and managing data across multiple nodes in a network. They provide scalability, fault tolerance, improved performance, and cost savings. By distributing data across nodes, they allow for efficient processing of large amounts of data and redundancy against failures. They can also be used to store data across multiple locations The post Turso: Globally Replicated SQLite with Glauber Costa appeared first on Software Engineering Daily.

  • Observability Trends with John Hart

    20/03/2023 Duración: 31min

    DataSet is a log analytics platform provided by Sentinel One that helps DevOps, IT engineering, and security teams get answers from their data across all time periods, both live streaming and historical. It’s powered by a unique architecture that uses a massively parallel query engine to provide actionable insights from the data available. John Hart The post Observability Trends with John Hart appeared first on Software Engineering Daily.

  • Data Investing and the MAD with Matt Turck

    10/03/2023 Duración: 51min

    There are many types of early stage funding available from friends and family to seed to series A.  Some firms invest across a wide set of technologies and seek only to provide capital. Others are in it for the long haul – they focus on specific areas of technology and develop both long term relationships The post Data Investing and the MAD with Matt Turck appeared first on Software Engineering Daily.

  • Accessing Data at Scale with Justin Borgman

    11/11/2022 Duración: 53min

    The Presto/Trino project makes distributed querying easier across a variety of data sources. As the need for machine learning and other high volume data applications has increased, the need for support, tooling, and cloud infrastructure for Presto/Trino has increased with it. Starburst helps your teams run fast queries on any data source. With Starburst you The post Accessing Data at Scale with Justin Borgman appeared first on Software Engineering Daily.

  • Building on the Data Cloud with Torsten Grabs

    07/11/2022 Duración: 47min

    Building and managing data-intensive applications has traditionally been costly and complex, and has placed an operational burden on developers to maintain as their organization scales. Todays’ developers, data scientists, and data engineers need a streamlined, single cloud data platform for building applications, pipelines, and machine learning models — without having to move or copy their The post Building on the Data Cloud with Torsten Grabs appeared first on Software Engineering Daily.

  • Tinybird with Jorge Sancha

    12/09/2022 Duración: 41min

    Data analytics technology and tools have seen significant improvements in the past decade. But, it can still take weeks to prototype, build and deploy new transformations and deployments, usually requiring considerable engineering resources. Plus, most data isn’t real-time. Instead, most of it is still batch-processed. Tinybird Analytics provides an easy way to ingest and query The post Tinybird with Jorge Sancha appeared first on Software Engineering Daily.

  • Data Infrastructure for Finance

    18/08/2022 Duración: 54min

    Data is becoming a bank’s biggest asset. These complex enterprises have a huge opportunity ahead – to transform themselves to become a trusted hub of a much broader data ecosystem that goes beyond the financial industry and helps to form a new class of cross-industry experience architectures that are scalable and transparent. The data physics The post Data Infrastructure for Finance appeared first on Software Engineering Daily.

  • Faking Data Using Tonic.ai with Ian Coe and Adam Kamor

    05/08/2022 Duración: 41min

    Ian Coe CEO Adam Kamor Head of Engineering Companies that gather data about their users have an ethical obligation and legal responsibility to protect the personally identifiable information in their dataset.  Ideally, developers working on a software application wouldn’t need access to production data. Yet without high-quality example data, many technology groups stumble on avoidable The post Faking Data Using Tonic.ai with Ian Coe and Adam Kamor appeared first on Software Engineering Daily.

  • Couchbase with Ravi Mayuram

    28/07/2022 Duración: 37min

    Couchbase is a distributed NoSQL cloud database. Since its creation, Couchbase has expanded into edge computing, application services, and most recently, a database-as-a-service called Capella.  Couchbase started as an in-memory cache and needed to be rearchitected to be a persistent storage system. In this episode, We interviewed Ravi Mayuram, SVP Products, and Engineering at Couchbase. The post Couchbase with Ravi Mayuram appeared first on Software Engineering Daily.

  • Decodable Streaming with Eric Sammer

    01/06/2022 Duración: 44min

    Streaming data platforms like Kafka, Pulsar, and Kinesis are now common in mainstream enterprise architectures, providing low-latency real-time messaging for analytics and applications. However, stream processing – the act of filtering, transforming, or analyzing the data inside the messages – is still an exercise left to the receiving microservice or datastore, a custom programming exercise The post Decodable Streaming with Eric Sammer appeared first on Software Engineering Daily.

  • Data Delivery with Naqeeb Memon

    14/05/2022 Duración: 28min

      Data-as-a-service is a company category type that is not as common as API-as-a-service, software-as-a-service, or platform-as-a-service. In order to vend data, a data-as-a-service provider needs to define how that data will be priced, stored, and delivered to users: streaming over an API or served via static files. Naqeeb Memon of Safegraph joins the show The post Data Delivery with Naqeeb Memon appeared first on Software Engineering Daily.

  • Data Labeling with Michael Malyuk

    11/05/2022 Duración: 41min

    Data labeling allows machine learning algorithms to find patterns among the data. There are a variety of data labeling platforms that enable humans to apply labels to this data and ready it for algorithms. Heartex is a data labeling platform with an open source core. Michael Malyuk joins the show to talk through the platform The post Data Labeling with Michael Malyuk appeared first on Software Engineering Daily.

  • Pinot and StarTree with Chinmay Soman

    09/05/2022 Duración: 44min

    Real-time analytics are difficult to achieve because large amounts of data must be integrated into a data set as that data streams in. As the world moved from batch analytics powered by Hadoop into a norm of “real-time” analytics, a variety of open source systems emerged. One of these was Apache Pinot. StarTree is a The post Pinot and StarTree with Chinmay Soman appeared first on Software Engineering Daily.

  • Data Loss Prevention with Yasir Ali

    29/04/2022 Duración: 40min

    Data loss can occur when large data sources such as Slack or Google Drive get leaked. In order to detect and avoid leaks, a data asset graph can be built to understand the risks of a company environment. Polymer is a data loss prevention product that helps companies avoid problematic data leaks. Yasir Ali is The post Data Loss Prevention with Yasir Ali appeared first on Software Engineering Daily.

  • Airbyte Engineering with Michel Tricot

    27/04/2022 Duración: 42min

    Data integration infrastructure is not easy to build. Moving large amounts of data from one place to another has historically required developers to build ad hoc integration points to move data between SaaS services, data lakes, and data warehouses. Today, there are dedicated systems and services for moving these large batches of data. Airbyte builds The post Airbyte Engineering with Michel Tricot appeared first on Software Engineering Daily.

página 2 de 5