Explore all our content

Dive into our back catalogue of content. With over 2000 pieces of content to digest, there will definitely be some software testing content of interest to you.

Displaying contents 2251 - 2280 of 6148 in total

Searching...

Four people represented in a simple cartoonish way collaborating around a shared board, learning together
You've heard the cliché, it's not what you know, it's who you know. When people see you as a leader and look to you for answers, you can'...
 A group of five smiling women stands behind a large, rustic wooden structure at night. The women are looking towards the camera. The woman on the far right rests her hand on the wooden structure. A duck, a seagull and a bug character have been added to the image.
Can you find Bug, Space Duck and Space Seagull?
Testbash 2025 profile for Gary Hawkes including an Image
About me: I'm coming from: Bury St Edmunds, Suffolk, UK My role is: QA Lead I'd love to meet others who are into: Leadership and anyone i...
A photo of me (Tom Game).  I am a white, middle-aged man with a bald head and beard.  I am wearing glass and a black t-shirt with a blue logo say "made in the 90s".
About Me: I’m coming from: Cambridge, UK My role is: Quality and Test Engineer I’d love to meet others who are into: Testing AI systems |...
A screenshot from the paper: The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity
Showing a composite figure illustrating how reasoning models solve the Tower of Hanoi problem, and how their performance varies with problem complexity.

Top Section – LLM Response Workflow:

On the left is a code-like LLM response showing a  section with a list of disk moves (e.g., [1, 0, 2], [2, 0, 1], etc.) and a  section referencing the final moves list. Arrows indicate:

Moves are extracted from the  section for analysis.

Final answer is extracted from the  section for measuring accuracy.

To the right, a sequence of three Tower of Hanoi diagrams represents:

Initial State: All disks stacked on peg 0.

Middle State: Disks distributed across pegs.

Target State: All disks correctly stacked on peg 2.
Each disk is color-coded and numbered for clarity.

Bottom Row – Three Line Graphs:

Left Graph: Accuracy vs. Complexity

Y-axis: Accuracy (%)

X-axis: Problem complexity (number of disks, from 1 to 20)

Two lines: Claude 3.7 (red circles) and Claude 3.7 with “thinking” mode (blue triangles).

Accuracy drops sharply for both as disk number increases, with “thinking” performing slightly better up to 8 disks.

Middle Graph: Response Length vs. Complexity

Y-axis: Token count

X-axis: Number of disks

“Thinking” responses grow rapidly in length with complexity, peaking near 8 disks.

Right Graph: Position of Error in Thought Process

Y-axis: Normalized position in the LLM’s reasoning (0 to 1)

X-axis: Complexity (1 to 15 disks)

Shows where correct vs. incorrect reasoning paths diverge; incorrect solutions typically fail earlier in the thoughts.

Background colors across all graphs denote complexity bands: yellow (easy), blue (moderate), red (hard).
Apple just tested the smartest "reasoning" AI Models out there: Claude 3.7 Sonnet, DeepSeek-R1, OpenAI’s o1/o3. The verdict? They didn’t...
Thumbs up from Rahul Parwal
A few things I’m proud of: 🧪 I break things for a living (so they don’t break in the wild). 📍 Based in Jaipur, India’s Pink City, where ...
A white woman in their 40s having a shoulder length dark blonde hear and wearing light blue jeans jacket.
Nataliia Burmei
Nataliia Burmei
About Me: I’m coming from: Surrey/London, England My role is: Lead Quality Engineer I’d love to meet others who are into: quality engi...
Follow you, follow me, quality engineering community: MoT Weekly – Issue 520 image
  • Simon Tomes's profile
What impact can following people in the MoTaverse have on your career and those around you? Read a roundup of news, events and actionable ideas in this week's MoT Weekly.
A man standing in a bookstore wearing a black t-shirt with white writing: test shirt please ignore. His face is not included in the original photo to protect his identity.
You can imagine my delight when I spotted this bloke in my local Waterstones bookstore. "I'm a tester, can I get a photo of your t-shirt...
Take software development practices with a pinch of salt image
A reminder that one size does not fit all and that time moves on
TestBash-2025-Brighton
About Me: I’m coming from: Bristol, UK My role is: QA Professional I’d love to meet others who are into: Passionate about Software testi...
The image is showing 3 pencils on while background. Empower othesr caption is on the image.
Nataliia Burmei
Nataliia Burmei
I am a big believer that confidence is built on actively doing things. Particularly, doing things you never done before, there is no much...
selfie of blond woman with colourful nails in front of a window with a view of sky and city
About Me: I’m coming from: Brighton, UK My role is: QA Lead in a small web agency I'm coming to TestBash 2025 as a: Ambassador and Atten...
Testing inside the boxes and beyond Ep 93 image
Testing meetups, community memories, and finding your voice — the This Week in Testing crew shares insights, inspiration, and reflections from around the globe.
I am a white woman with long straight light brown hair. Sat in my home office with a blurred background, wearing a green dress, I speak directly to the camera and showcase a variety of books which are listed in the description.
Hello, my name is Emily! I’m travelling to TestBash from Leeds (UK) and I’d love to meet and form stronger relationships with others who...
Crowdcast Android app during This Week in Testing episode.

Speakers Simon, Demi and Preeti are show at top of screen 'on stage'.

Below, there is a live chat where Eamon asks the question, 'Can someone explain grey box testing please?'

Aj and Oleksandr provide responses to this question.
A discussion on TWIT had me wondering what 'Grey Box Testing' was. Luckily, some lovely people were on hand to answer my question!
Subscribe to our newsletter
We'll keep you up to date on all the testing trends.