Bob Salmon
Tech Lead
I am Open to Mentor, Speak, Write, Podcasting
I'm a programmer who likes people, code and data. That means I think that things like quality and user experience are important too.
Achievements
Certificates
Level up your software testing and quality engineering skills with the credibility of a Ministry of Testing certification.
Activity
is Open to Podcasting
is Open to Write
is Open to Speak
achieved:
This badge is awarded to members who indicate they are open to work, writing, teaching, speaking, mentoring, CV reviewing or internships.
is Open to Mentor
Contributions
Learn how focusing on user value and trust gives you a clearer, more effective way to test data quality
A data contract is a document that defines the ownership, structure, semantics, quality, and terms of use for exchanging data between a data producer and their consumers. It is human- and machine-readable, and so can be used as both a communication tool between teams and a way to automatically detect when expectations about data are broken.
Write-Audit-Publish (WAP) is a pattern for designing data pipelines where a pipeline is built up of several sections. Each section produces a result data set that is used by one or more downstream sections and conforms to the same three-stage process:
Write: The main work of the section is done and data is written to a staging area that is inaccessible to other sections
Audit: The staging data is checked using automated checks
Publish: Only data that passes the audit is published to downstream sections of the pipeline
Medallion data architecture is a way of splitting the ingestion and processing of data into three stages: bronze, silver, and gold.
In the bronze stage, ingested data is stored in its unaltered form.
The silver stage attempts to fix problems in the bronze data and augment it by linking it with other data, producing a more usable version.
The gold stage takes the silver data and summarises it, along with any other processing needed to make the data ready for consumption by downstream processes.
YAML is a way of expressing structured data that is both machine-readable and human-readable, similar to JSON. It is often used for configuration and contracts that need to be understood and maintained by people as well as systems.
Data quality is weird - Bob Salmon
Bridge the gap between developers, testers, and data teams to create stronger, people-centred quality practices.
Tasked with reviewing a large requirements document? Bob Salmon has you covered with his handy tips for reviewing requirements documents
Bob Salmon solves the Data Management Challenge from Test.bash(); 2022 using SQLServer Studio and Visual Studio.