Explore all our content

Dive into our back catalogue of content. With over 2000 pieces of content to digest, there will definitely be some software testing content of interest to you.

Displaying contents 691 - 720 of 4607 in total

Searching...

Me, a white woman with medium length brown hair in a plait, looking very excited in front of my black and blue Yamaha motorbike, in a green Welsh landscape, near a large tree and a river
I'm coming from the Netherlands I'm a quality engineer for Dutch Railways I'm coming to TestBash as a host! I'm super interested in and w...
Why 'Human in the Loop' isn’t enough. Introducing Quality in the Loop (QITL) image
Building trustworthy AI requires a partnership between the two
Four people represented in a simple cartoonish way collaborating around a shared board, learning together
You've heard the cliché, it's not what you know, it's who you know. When people see you as a leader and look to you for answers, you can'...
 A group of five smiling women stands behind a large, rustic wooden structure at night. The women are looking towards the camera. The woman on the far right rests her hand on the wooden structure. A duck, a seagull and a bug character have been added to the image.
Can you find Bug, Space Duck and Space Seagull?
Testbash 2025 profile for Gary Hawkes including an Image
About me: I'm coming from: Bury St Edmunds, Suffolk, UK My role is: QA Lead I'd love to meet others who are into: Leadership and anyone i...
A photo of me (Tom Game).  I am a white, middle-aged man with a bald head and beard.  I am wearing glass and a black t-shirt with a blue logo say "made in the 90s".
About Me: I’m coming from: Cambridge, UK My role is: Quality and Test Engineer I’d love to meet others who are into: Testing AI systems |...
A screenshot from the paper: The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity
Showing a composite figure illustrating how reasoning models solve the Tower of Hanoi problem, and how their performance varies with problem complexity.

Top Section – LLM Response Workflow:

On the left is a code-like LLM response showing a  section with a list of disk moves (e.g., [1, 0, 2], [2, 0, 1], etc.) and a  section referencing the final moves list. Arrows indicate:

Moves are extracted from the  section for analysis.

Final answer is extracted from the  section for measuring accuracy.

To the right, a sequence of three Tower of Hanoi diagrams represents:

Initial State: All disks stacked on peg 0.

Middle State: Disks distributed across pegs.

Target State: All disks correctly stacked on peg 2.
Each disk is color-coded and numbered for clarity.

Bottom Row – Three Line Graphs:

Left Graph: Accuracy vs. Complexity

Y-axis: Accuracy (%)

X-axis: Problem complexity (number of disks, from 1 to 20)

Two lines: Claude 3.7 (red circles) and Claude 3.7 with “thinking” mode (blue triangles).

Accuracy drops sharply for both as disk number increases, with “thinking” performing slightly better up to 8 disks.

Middle Graph: Response Length vs. Complexity

Y-axis: Token count

X-axis: Number of disks

“Thinking” responses grow rapidly in length with complexity, peaking near 8 disks.

Right Graph: Position of Error in Thought Process

Y-axis: Normalized position in the LLM’s reasoning (0 to 1)

X-axis: Complexity (1 to 15 disks)

Shows where correct vs. incorrect reasoning paths diverge; incorrect solutions typically fail earlier in the thoughts.

Background colors across all graphs denote complexity bands: yellow (easy), blue (moderate), red (hard).
Apple just tested the smartest "reasoning" AI Models out there: Claude 3.7 Sonnet, DeepSeek-R1, OpenAI’s o1/o3. The verdict? They didn’t...
Thumbs up from Rahul Parwal
A few things I’m proud of: 🧪 I break things for a living (so they don’t break in the wild). 📍 Based in Jaipur, India’s Pink City, where ...
A white woman in their 40s having a shoulder length dark blonde hear and wearing light blue jeans jacket.
Nataliia Burmei
Nataliia Burmei
About Me: I’m coming from: Surrey/London, England My role is: Lead Quality Engineer I’d love to meet others who are into: quality engi...
Follow you, follow me, quality engineering community: MoT Weekly – Issue 520 image
  • Simon Tomes's profile
What impact can following people in the MoTaverse have on your career and those around you? Read a roundup of news, events and actionable ideas in this week's MoT Weekly.
A man standing in a bookstore wearing a black t-shirt with white writing: test shirt please ignore. His face is not included in the original photo to protect his identity.
You can imagine my delight when I spotted this bloke in my local Waterstones bookstore. "I'm a tester, can I get a photo of your t-shirt...
A starry TestBash Brighton 2025 beach scene features beach huts filled with familiar MoTaverse characters. There’s a bug in a top hat, one with angel wings, another wearing a cape, and one clinging joyfully to a hut roof. A duck in a space suit waddles in. The iconic MoT flag flies, while a seagull soars overhead with a pencil in its beak. In the centre, a tester gently rocks a pram containing a test bug. Postman’s sponsorship arrives with energy and collaboration, expanding the API testing frontier in the MoTaverse. Keywords: Postman sponsor, TestBash Brighton 2025, Ministry of Testing, MoTaverse, API testing, software testing community, bug characters, quality engineering event.
We’re thrilled to welcome Postman as a Gold Sponsor for TestBash Brighton 2025! A leader in API collaboration, Postman helps testers an...
TestBash Brighton 2025 session: “A day in the life of a Quality Lead” by Elizabeth Zagroba. This talk makes the behind-the-scenes work of a quality lead visible, helping attendees understand how to advocate for their work, interact with leadership, and see what quality looks like beyond their own team. Keywords: Elizabeth Zagroba, quality leadership, day in the life QA, TestBash Brighton 2025, software testing careers, QA visibility, Ministry of Testing, quality roles, work transparency.
Elizabeth Zagroba shares what it really looks like to be a Quality Lead. See the work behind the title, how to make your impact visible...
Take software development practices with a pinch of salt image
A reminder that one size does not fit all and that time moves on
TestBash-2025-Brighton
About Me: I’m coming from: Bristol, UK My role is: QA Professional I’d love to meet others who are into: Passionate about Software testi...
Subscribe to our newsletter
We'll keep you up to date on all the testing trends.