[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: occasional kernel freezes possibly related to aac(4) 2410SA
This is most likely a firmware hang that causes the aac timed out
messages. Yes, it is a good idea to contact adaptec and tell them that
they have firmware issues. The more people that contact them the
better.
Good luck,
/marco
On Jan 8, 2005, at 10:46 AM, Ingo Schwarze wrote:
> Hi Marco Peereboom, hi Jim Razmus, hi misc,
>
> last Xmas, i reported occasional problems with our i386 based
> nfs file server using an Adaptec 2410SA SATA RAID controller, see
>
> Message-ID: <20041223233510.GA5131@athene.usta.de>
> Message-ID: <20041224042231.GB28407@mail.bonetruck.org>
> Message-ID: <ED62BEBC-55D2-11D9-8DF9-000A95908CA4@peereboom.us>
> Message-ID: <20041227194636.GA5438@athene.usta.de>
>
> With controller firmware 4.0-0 (factory default), i had three
> or four freezes during two weeks in december. With controller
> firmware 4.1-0 (=Build 7244), i had half a dozen freezes
> during a few hours in a single day (Dec 26). With controller
> firmware 4.2-0 (=Build 7348), i now have one freeze after two
> weeks: The machine has been working from Dec 27 until Jan 08;
> today, it died once again. Its last words were, as usual:
>
> sd0(aac0:0:0): timed out
>
> My conclusion is:
> a) The freezes are very probably in some way or other related
> to the controller since their frequency varies with the
> firmware version in use, and
> b) 4.2-0 is better than 4.0-0 is better than 4.1-0,
> but 4.2-0 still does not seem to be perfect.
>
> I had already gone back to the GENERIC kernel after one week
> or so of stable operation, so once more i cannot tell which
> SCSI command was the last one before the hang. I will now
> once more boot my "AAC_DEBUG=0x0C"-kernel and wait for the
> next incident.
>
> Please tell me when anybody is working on aac(4) or when
> i can help with any testing.
>
> Yours,
> Ingo
>
> P.S. off-topic:
> Btw, i'm pondering whether i should contact Adaptec asking
> for better firmware or better hardware - the collected evidence
> that this is rather due to the controller itself or its
> firmware than to the rest of my box and the OS is not that
> bad after all, isn't it? There's no new info on the Adaptec
> homepage yet, i just checked once more...