[GTALUG] war story: read you kernel log (dmesg) once in a while

ac ac at main.me
Sat Mar 20 04:08:43 EDT 2021


Hi,

Thank you for your war story, for me it was very useful as it relates to SSD :)

On Fri, 19 Mar 2021 11:24:11 -0400 (EDT)
"D. Hugh Redelmeier via talk" <talk at gtalug.org> wrote:

> [One reason for this message is to test if the mailing list is
> working.  I haven't seen a new message in a 10 days.]
> 
> dmseg command
> =============
> 
> The dmesg command shows you the kernel log.  It takes the log from
> the kernel itself.  It is stored in a circular RAM buffer, so you can
> still read it if the normal logging system isn't working.  This
> buffer is a fixed size so older messages can get pushed out by newer
> ones if there is enough logging going on.
> 
> You can get more info on Fedora by
> 	journalctl -b
> but it isn't limited to kernel messages.  It does colour-code
> messages based on severity, so that's a nice plus.  Since this log
> typically goes to disk, it tends to be complete.  Oh: the -b flag
> means: start from the most recent boot -- logs can go back months and
> years.
> 
> As an old timer, my first instinct is to use dmesg.
> 
> looking at kernel messages
> ==========================
> 
> dmesg | less -i
> 
> dmesg pours out a lot of lines.  less is a good way of navigating
> this log.  The -i makes searches within less case-insensitive.
> 
> The Linux kernel is meant to log problems and move on.  This means
> that there can be problems that you don't even know about because all
> looks well.  I think it pays to once in a while look for problems
> reported in the log.
> 
> A lot of messages will be inscrutable.  If they intrigue you,
> investigate them.
> 
> Here's one I noticed recently on one of my systems.  It's been there
> for the whole life of the system, but I never noticed.  
> 
> [    2.545003] ata1: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
> [    2.545482] ata1.00: supports DRM functions and may not be fully
> accessible [    2.545554] ata1.00: READ LOG DMA EXT failed, trying
> unqueued [    2.548363] ata1.00: disabling queued TRIM support
> [    2.548370] ata1.00: ATA-9: Crucial_CT240M500SSD1, MU05, max
> UDMA/133 [    2.548376] ata1.00: 468862128 sectors, multi 16: LBA48
> NCQ (depth 31/32), AA [    2.551973] ata1.00: supports DRM functions
> and may not be fully accessible [    2.554812] ata1.00: disabling
> queued TRIM support [    2.558006] ata1.00: configured for UDMA/133
> 
> Look at "disabling queued TRIM support".
> What's that about?
> 
> - TRIM is a useful feature in SSDs.  It allows the OS to advise the
> SSD that chunks of the filesystem are no longer needed (eg. deleted
> files). This helps the SSD's wear-levelling firmware's garbage
> collector.  It should help speed up the SSD and add to its lifetime.
> 
> - Even without queued TRIM, TRIM can still be accomplished.  Queued
> TRIM is some higher-performance variant (I haven't investigate).
> 
> But what's up with this message?  Googling got me to
> 	<https://bugzilla.kernel.org/show_bug.cgi?id=71371>
> Very useful.
> 
> - apparently my SSD (a Crucial M500) had buggy firmware, leading to 
>   corruption in some cases.  Including queue TRIM
> 
> - Crucial released new firmware (in 2015)
> 	<https://www.crucial.com/support/ssd-support/m500-support>
> 
> - my SSD has this firmware (MU05), as reported in the dmesg output
> 
> - even after the update, M500's screw up queued TRIM
> 
> - the Linux kernel embeds all this wisdom and it blacklists queued
> TRIM on my box
> 
> I spent an hour investigating this.  There was no effect, except that
> I learned a few things.  Linux just does the right thing.
> 
> I do recommend also looking at journalctl output because it
> highlights things that it thinks are of particular interest.
> ---
> Post to this mailing list talk at gtalug.org
> Unsubscribe from this mailing list
> https://gtalug.org/mailman/listinfo/talk



More information about the talk mailing list