Data Software Engineering Daily

Sinopsis

Databases and data engineering episodes of Software Engineering Daily

Episodios

  • LinkedIn Data Platform with Carl Steinbach

    LinkedIn Data Platform with Carl Steinbach

    23/10/2019 Duración: 52min

    LinkedIn is a social network with petabytes of data.  In order to store that data, LinkedIn distributes and replicates that data across a large cluster of machines running the Hadoop Distributed File System. In order to run calculations across its large data set, LinkedIn needs to split the computation up using MapReduce-style jobs. LinkedIn has The post LinkedIn Data Platform with Carl Steinbach appeared first on Software Engineering Daily.

  • Incident Reproduction with Tammy Butow

    Incident Reproduction with Tammy Butow

    16/10/2019 Duración: 01h01min

    Databases go offline. Services fail to scale up. Deployment errors can cause an application backend to get DDoS’d. When an event happens that prevents your company from operating as expected, it is known as an incident. Software teams respond to an incident by issuing a fix. Sometimes that fix returns the software to its ideal The post Incident Reproduction with Tammy Butow appeared first on Software Engineering Daily.

  • Diffbot Infrastructure with Mike Tung

    Diffbot Infrastructure with Mike Tung

    15/10/2019 Duración: 01h06min

    Diffbot is a knowledge graph that allows developers to interface with the unstructured web as if it was a structured database. In today’s show, Diffbot CEO Mike Tung returns for a second discussion about how he has built Diffbot and how Diffbot is used. The web has many different entities. Web pages, topics, people, stories, The post Diffbot Infrastructure with Mike Tung appeared first on Software Engineering Daily.

  • Data Visualization with Sherman Wood and Chad Lumley

    Data Visualization with Sherman Wood and Chad Lumley

    10/10/2019 Duración: 51min

    Data visualization is the presentation of data in a way that emphasizes certain qualities about that data. Data visualization can be used to prove a specific point, or it can be used as a depiction of a data set to be explored. Data visualization is used in consumer software products as well as backend engineering The post Data Visualization with Sherman Wood and Chad Lumley appeared first on Software Engineering Daily.

  • GDPR in Practice with Joshua Prismon

    GDPR in Practice with Joshua Prismon

    07/10/2019 Duración: 52min

    Data privacy policies have changed how software organizations need to operate. As consumer preferences have shifted in favor of strong privacy, software companies are having to examine their policies around data collection and retention. Many software companies were started in a time with different norms around data.  Building a new application that is compliant with The post GDPR in Practice with Joshua Prismon appeared first on Software Engineering Daily.

  • Fivetran: Data Connectors with George Fraser

    Fivetran: Data Connectors with George Fraser

    02/10/2019 Duración: 51min

    Large companies have multiple databases, multiple data formats, and multiple applications that need to use the data. Every data engineer needs to move data between these different components of a system. Moving data between different parts of a system is often called “ETL”, an acronym for “Extract, Transform, Load.” Data engineers spend much of their The post Fivetran: Data Connectors with George Fraser appeared first on Software Engineering Daily.

  • Life Insurance Engineering with Vipul Sharma and Lingke Wang

    Life Insurance Engineering with Vipul Sharma and Lingke Wang

    26/09/2019 Duración: 59min

    Ethos Life Insurance is a software company that sells life insurance products. Software is reshaping established industries such as banking, insurance, and manufacturing. In these large, established industries, incumbents are adopting new technology as fast as they can, but the new technology needs to be integrated with the old technology. The slow rate of technology The post Life Insurance Engineering with Vipul Sharma and Lingke Wang appeared first on Software Engineering Daily.

  • Kafka Data Pipelines with Robin Moffatt

    Kafka Data Pipelines with Robin Moffatt

    23/09/2019 Duración: 54min

    A new software product usually starts with a single database. That database manages the tables for user accounts and basic transactions. When a product becomes popular, the database grows in size. There are more transactions and more users. A company grows around that product, and the company starts to accumulate more data in different sources. The post Kafka Data Pipelines with Robin Moffatt appeared first on Software Engineering Daily.

  • ReadMe with Greg Koberger

    ReadMe with Greg Koberger

    20/09/2019 Duración: 01h22min

    A software company needs to get many things right in order to be successful. Having a useful product with solid engineering is only the beginning. ReadMe was started five years ago. The company solved a seemingly simple problem: documentation for software products. If you have worked as a software engineer, you have looked at documentation. The post ReadMe with Greg Koberger appeared first on Software Engineering Daily.

  • Distributed Databases with Aly Cabral

    Distributed Databases with Aly Cabral

    18/09/2019 Duración: 57min

    Modern databases consist of multiple servers that host the data in a distributed fashion. Using multiple servers allows a database to be resilient to the failure of any one database node, because copies of the data are shared to other servers. A multi-node setup also lets the database grow beyond the size of data that The post Distributed Databases with Aly Cabral appeared first on Software Engineering Daily.

  • Talking Python with Michael Kennedy

    Talking Python with Michael Kennedy

    12/09/2019 Duración: 56min

    Python is one of the most popular programming languages in the software world. After working with Python and developing a love for the language, Michael Kennedy started to wonder why there was not a high quality podcast dedicated to covering the community and new technologies of the Python ecosystem. Michael started Talk Python To Me The post Talking Python with Michael Kennedy appeared first on Software Engineering Daily.

  • MongoDB Data Platform with Andrew Davidson

    MongoDB Data Platform with Andrew Davidson

    11/09/2019 Duración: 52min

    A new software application has simple requirements for a database.  The database needs to be written to and read from. The database fulfills simple needs such as storing user information and providing the application frontend with the necessary data to render a simple webpage of financial transactions or blog posts. As an application becomes successful, The post MongoDB Data Platform with Andrew Davidson appeared first on Software Engineering Daily.

  • Google Spanner with Deepti Srivastava

    Google Spanner with Deepti Srivastava

    10/09/2019 Duración: 57min

    Spanner is a globally distributed, transactionally consistent database.  Spanner initially emerged as a paper that came out of Google in 2012. Around this time, database scalability was difficult to solve, even for Google. The Spanner paper offered some breakthroughs in distributed systems which allowed Google to take some of the learnings from BigTable’s eventual consistency The post Google Spanner with Deepti Srivastava appeared first on Software Engineering Daily.

  • Monday: Business Management Software with Eran Zinman

    Monday: Business Management Software with Eran Zinman

    05/09/2019 Duración: 57min

    Modern software is built with cloud services, APIs, and other high level tools. Technical software development is moving beyond the realm of writing code. Individuals who do not have a background in computer science or programming can create increasingly complex tools. Higher level APIs include Twilio for managing phone communications, and Stripe for managing financial The post Monday: Business Management Software with Eran Zinman appeared first on Software Engineering Daily.

  • Cassandra Business with Jonathan Ellis

    Cassandra Business with Jonathan Ellis

    04/09/2019 Duración: 54min

    Cassandra was initially released in 2008 as a project out of Facebook. Cassandra offered an open source solution to database scalability issues that were being tackled internally by large companies like Amazon, Google and Facebook. 2008 was a golden age of new infrastructure, with systems such as Hadoop and Kafka gaining traction around the same The post Cassandra Business with Jonathan Ellis appeared first on Software Engineering Daily.

  • DevOps at Delta Air Lines with Jasmine James

    DevOps at Delta Air Lines with Jasmine James

    03/09/2019 Duración: 43min

    Airlines have always had an emphasis on new technology. Over the years, airlines have needed to develop more and more software. Digital transformation is causing every large company to adopt the tools and practices of software companies, and that includes Delta Airlines. Delta Airlines has existed for more than 90 years. Over that period of The post DevOps at Delta Air Lines with Jasmine James appeared first on Software Engineering Daily.

  • Facebook Scaling with Pedram Keyani

    Facebook Scaling with Pedram Keyani

    26/08/2019 Duración: 58min

    Facebook is a large multiuser application. Scaling Facebook was different than scaling a single-user application such as an ecommerce store or a search engine. A social network is faced with unique infrastructure scalability challenges, as well as subjective questions around user communications, privacy, and content. Pedram Keyani worked at Google before joining Facebook in 2007. The post Facebook Scaling with Pedram Keyani appeared first on Software Engineering Daily.

  • Time Series Databases with Rob Skillington

    Time Series Databases with Rob Skillington

    21/08/2019 Duración: 58min

    A time series database is optimized for the storage of high volumes of sequential data across time. Time series databases are often organized as columnar data stores that can write large volumes of data quickly. These systems can sometimes tolerate data loss, because the data they are gathering is used for monitoring and other applications The post Time Series Databases with Rob Skillington appeared first on Software Engineering Daily.

  • Data-As-A-Service with Auren Hoffman

    Data-As-A-Service with Auren Hoffman

    08/08/2019 Duración: 01h02min

    Data-as-a-service businesses offer paid access to data sets. These data sets can be useful for building products or training machine learning models.  There has been steady growth in the tools and practices around processing and storing data. But access to data sets remains a bottleneck for widespread development of machine learning applications in a large The post Data-As-A-Service with Auren Hoffman appeared first on Software Engineering Daily.

  • Uber’s Data Platform with Zhenxiao Luo

    Uber’s Data Platform with Zhenxiao Luo

    24/05/2018 Duración: 01h02min

    When a user takes a ride on Uber, the app on the user’s phone is communicating with Uber’s backend infrastructure, which is writing to a database that maintains the state of that user’s activity. This database is known as a transactional database or “OLTP” (online transaction processing). Every active user and driver and UberEATS restaurant The post Uber’s Data Platform with Zhenxiao Luo appeared first on Software Engineering Daily.

página 5 de 5

Informações: