VM Hangs – Azure, AWS, GCP – How to analyse why?

Get data

  1. Connect to Serial Console in Azure.
  2. type cmd.
  3. press escape – TAB – you should reach page when you can enter the username and password.
  4. try to authenticate and now you have access to the operating system, so you can do the analyses.
  5. if it is not possible, return to Serial Console (escape TAB).
  6. Type crashdump and press enter. This generate memory dump that can be analised.
  7. Restart machine and somehow download c:\windows\memory.dmp

Install Window Debuger . The best is to use Windows 11 for this.

Launch WinDBG and open the downloaded memory.dmp. You do not need to add symbols server – just in the latest version it will be picked automatically.

Execute the follwoing:

  1. !analyze -v – in this scenario it tells you nothing just you intentianally invoke the system crash. If System restart automatically it could help in analises (espacially for blue screens).
  2. !process 0 0 – show all processes.
  3. !process 0 7 – look for any with unusual CPU time or state.
  4. !locks – Shows kernel locks and potential deadlocks.
  5. !irql – Check interrupt request levels.
  6. !ready – Shows threads ready to run (may indicate scheduling issues).
  7. !vm – Virtual memory usage.
  8. !poolused – Kernel pool usage – look for pool exhaustion.
  9. !verifier – Driver verifier status.
  10. !devnode 0 1 – Device tree – look for failed devices.
  11. lm t n – List loaded modules/drivers with timestamps.
  12. !drvobj – Examine specific driver objects if you see suspicious ones.
  13. !thread – Examine the current thread context.
  14. !qlocks – Queue locks.
  15. !wdfkd.wdflogdump – Windows Driver Framework logs (if applicable.
  16. !poolused 2 – This shows pool usage sorted by size (largest first) – helps identify the biggest memory consumers.
  17. !deadlock – Looks for deadlock detection information.

You can put the results for any AI tool for the analyses.

My last analyses ends with message:

******* 266688 kernel stack PTE allocations have failed ******
******* 365226304 kernel stack growth attempts have failed ******