Sign in to follow this  
Followers 0
anywhere

crazy RIDICLIOUS head fly hours.....

5 posts in this topic

alright, i'm more concerned of the head flying hours. mind you the numbers are changing wildly in the billions mark, as fast as i issue an a smartctl -a command...wtf?!?!??!?!

one second, 200 some trillion, next second 400 some trillion...

and today i went to set my energy drink on the encloure, it almost seems like an acoustic ding makes one of the two drives click...so for kicks i took my nail and flicked the side of the case, drive clicks....?????

i'm worried, these are raid5, but mdadm on creation of raid5 recongnizes them as mirrored, and will do parity build upon adding a 3rd drive. so right now if one pukes, i'm still safe, but, still, this is serious WTF material for such new drives with low hours. and they're lightly accessed....400kb-1Mb on local LAN for movie playing, they basically spin without any serious load, other then the occassional first sunday of the month check (mirror checking until 3rd drive, then parity checking)

i have 2 more ordered, will be here the 14th. i plan to make a 2nd array ( and properly sector align the partitions this time), duplicate everything,, destroy the 1st, triple check the first two drives, add a 3rd drive, let it resync, use resize2fs to grow the new MD size, then add the 4th, grow to raid6, ....and hope for the best. heh. this is the my first step, of expanding my home media server. 6tb at a time.

obvisouly i started small, and mdadm has fabulous features of expanding without headache, along side of ext4. thank god i picked this method, i'm emptying my 2Tb seed box maybe once every 10 weeks.

anyways, what you guys think of these wild smart reads??

i included some important system information. it's a 2.8ghz when called

root@razorblade:/home/jason# uname -a
Linux razorblade 3.2.0-4-686-pae #1 SMP Debian 3.2.46-1 i686 GNU/Linux

root@razorblade:/home/jason# cat /proc/cpuinfo
processor : 0
vendor_id : AuthenticAMD
cpu family : 15
model : 95
model name : AMD Athlon 64 Processor 3800+
stepping : 2
microcode : 0x62
cpu MHz : 1000.000
cache size : 512 KB
fdiv_bug : no
hlt_bug : no
f00f_bug : no
coma_bug : no
fpu : yes
fpu_exception : yes
cpuid level : 1
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 syscall nx mmxext fxsr_opt rdtscp lm 3dnowext 3dnow up extd_apicid pni cx16 lahf_lm svm extapic cr8_legacy
bogomips : 2009.08
clflush size : 64
cache_alignment : 64
address sizes : 40 bits physical, 48 bits virtual
power management: ts fid vid ttp tm stc

root@razorblade:/home/jason# cat /proc/meminfo
MemTotal: 448204 kB
MemFree: 248404 kB
Buffers: 14676 kB
Cached: 133568 kB
SwapCached: 0 kB
Active: 66944 kB
Inactive: 95628 kB
Active(anon): 14796 kB
Inactive(anon): 208 kB
Active(file): 52148 kB
Inactive(file): 95420 kB
Unevictable: 0 kB
Mlocked: 0 kB
HighTotal: 0 kB
HighFree: 0 kB
LowTotal: 448204 kB
LowFree: 248404 kB
SwapTotal: 0 kB
SwapFree: 0 kB
Dirty: 0 kB
Writeback: 0 kB
AnonPages: 14328 kB
Mapped: 9912 kB
Shmem: 688 kB
Slab: 29140 kB
SReclaimable: 21252 kB
SUnreclaim: 7888 kB
KernelStack: 1192 kB
PageTables: 868 kB
NFS_Unstable: 0 kB
Bounce: 0 kB
WritebackTmp: 0 kB
CommitLimit: 224100 kB
Committed_AS: 156044 kB
VmallocTotal: 576632 kB
VmallocUsed: 31864 kB
VmallocChunk: 544244 kB
HardwareCorrupted: 0 kB
AnonHugePages: 0 kB
HugePages_Total: 0
HugePages_Free: 0
HugePages_Rsvd: 0
HugePages_Surp: 0
Hugepagesize: 2048 kB
DirectMap4k: 41856 kB
DirectMap2M: 415744 kB

root@razorblade:/home/jason# cat /root/sda
smartctl 5.41 2011-06-09 r3365 [i686-linux-3.2.0-4-686-pae] (local build)
Copyright © 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net

=== START OF INFORMATION SECTION ===
Device Model: ST3000DM001-1CH166
Serial Number: W1F42CEQ
LU WWN Device Id: 5 000c50 06e302226
Firmware Version: CC27
User Capacity: 3,000,592,982,016 bytes [3.00 TB]
Sector Sizes: 512 bytes logical, 4096 bytes physical
Device is: Not in smartctl database [for details use: -P showall]
ATA Version is: 9
ATA Standard is: Not recognized. Minor revision code: 0x001f
Local Time is: Tue Mar 11 16:04:09 2014 EDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status: (0x82) Offline data collection activity
was completed without error.
Auto Offline Data Collection: Enabled.
Self-test execution status: ( 0) The previous self-test routine completed
without error or no self-test has ever
been run.
Total time to complete Offline
data collection: ( 584) seconds.
Offline data collection
capabilities: (0x7b) SMART execute Offline immediate.
Auto Offline data collection on/off support.
Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine
recommended polling time: ( 1) minutes.
Extended self-test routine
recommended polling time: ( 255) minutes.
Conveyance self-test routine
recommended polling time: ( 2) minutes.
SCT capabilities: (0x3085) SCT Status supported.

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000f 108 099 006 Pre-fail Always - 16622608
3 Spin_Up_Time 0x0003 095 093 000 Pre-fail Always - 0
4 Start_Stop_Count 0x0032 100 100 020 Old_age Always - 41
5 Reallocated_Sector_Ct 0x0033 100 100 010 Pre-fail Always - 0
7 Seek_Error_Rate 0x000f 061 060 030 Pre-fail Always - 17185157530
9 Power_On_Hours 0x0032 098 098 000 Old_age Always - 1870
10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail Always - 0
12 Power_Cycle_Count 0x0032 100 100 020 Old_age Always - 41
183 Runtime_Bad_Block 0x0032 100 100 000 Old_age Always - 0
184 End-to-End_Error 0x0032 100 100 099 Old_age Always - 0
187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always - 0
188 Command_Timeout 0x0032 100 100 000 Old_age Always - 0
189 High_Fly_Writes 0x003a 100 100 000 Old_age Always - 0
190 Airflow_Temperature_Cel 0x0022 072 066 045 Old_age Always - 28 (Min/Max 22/31)
191 G-Sense_Error_Rate 0x0032 100 100 000 Old_age Always - 0
192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age Always - 33
193 Load_Cycle_Count 0x0032 100 100 000 Old_age Always - 65
194 Temperature_Celsius 0x0022 028 040 000 Old_age Always - 28 (0 17 0 0)
197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0010 100 100 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age Always - 0
240 Head_Flying_Hours 0x0000 100 253 000 Old_age Offline - 34497177323342
241 Total_LBAs_Written 0x0000 100 253 000 Old_age Offline - 13822404900
242 Total_LBAs_Read 0x0000 100 253 000 Old_age Offline - 45934373780

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Extended offline Completed without error 00% 1316 -
# 2 Short offline Completed without error 00% 1311 -

SMART Selective self-test log data structure revision number 1
SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
1 0 0 Not_testing
2 0 0 Not_testing
3 0 0 Not_testing
4 0 0 Not_testing
5 0 0 Not_testing
Selective self-test flags (0x0):
After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

and for sdb

root@razorblade:/home/jason# cat /root/sdb
smartctl 5.41 2011-06-09 r3365 [i686-linux-3.2.0-4-686-pae] (local build)
Copyright © 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net

=== START OF INFORMATION SECTION ===
Device Model: ST3000DM001-1CH166
Serial Number: W1F440Q7
LU WWN Device Id: 5 000c50 06e31d5e3
Firmware Version: CC27
User Capacity: 3,000,592,982,016 bytes [3.00 TB]
Sector Sizes: 512 bytes logical, 4096 bytes physical
Device is: Not in smartctl database [for details use: -P showall]
ATA Version is: 9
ATA Standard is: Not recognized. Minor revision code: 0x001f
Local Time is: Tue Mar 11 16:04:16 2014 EDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status: (0x82) Offline data collection activity
was completed without error.
Auto Offline Data Collection: Enabled.
Self-test execution status: ( 0) The previous self-test routine completed
without error or no self-test has ever
been run.
Total time to complete Offline
data collection: ( 584) seconds.
Offline data collection
capabilities: (0x7b) SMART execute Offline immediate.
Auto Offline data collection on/off support.
Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine
recommended polling time: ( 1) minutes.
Extended self-test routine
recommended polling time: ( 255) minutes.
Conveyance self-test routine
recommended polling time: ( 2) minutes.
SCT capabilities: (0x3085) SCT Status supported.

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000f 108 099 006 Pre-fail Always - 15164504
3 Spin_Up_Time 0x0003 095 093 000 Pre-fail Always - 0
4 Start_Stop_Count 0x0032 100 100 020 Old_age Always - 41
5 Reallocated_Sector_Ct 0x0033 100 100 010 Pre-fail Always - 0
7 Seek_Error_Rate 0x000f 058 058 030 Pre-fail Always - 30070347153
9 Power_On_Hours 0x0032 098 098 000 Old_age Always - 1870
10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail Always - 0
12 Power_Cycle_Count 0x0032 100 100 020 Old_age Always - 41
183 Runtime_Bad_Block 0x0032 100 100 000 Old_age Always - 0
184 End-to-End_Error 0x0032 100 100 099 Old_age Always - 0
187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always - 0
188 Command_Timeout 0x0032 100 100 000 Old_age Always - 0
189 High_Fly_Writes 0x003a 100 100 000 Old_age Always - 0
190 Airflow_Temperature_Cel 0x0022 071 066 045 Old_age Always - 29 (Min/Max 22/32)
191 G-Sense_Error_Rate 0x0032 100 100 000 Old_age Always - 0
192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age Always - 33
193 Load_Cycle_Count 0x0032 100 100 000 Old_age Always - 66
194 Temperature_Celsius 0x0022 029 040 000 Old_age Always - 29 (0 17 0 0)
197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0010 100 100 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age Always - 0
240 Head_Flying_Hours 0x0000 100 253 000 Old_age Offline - 9345848837966
241 Total_LBAs_Written 0x0000 100 253 000 Old_age Offline - 7833754512
242 Total_LBAs_Read 0x0000 100 253 000 Old_age Offline - 51393562395

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Extended offline Completed without error 00% 1316 -
# 2 Short offline Completed without error 00% 1311 -

SMART Selective self-test log data structure revision number 1
SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
1 0 0 Not_testing
2 0 0 Not_testing
3 0 0 Not_testing
4 0 0 Not_testing
5 0 0 Not_testing
Selective self-test flags (0x0):
After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay

2 people like this

Share this post


Link to post
Share on other sites

So I guess I'll trim so it fits better.

These measured in hex? Decimal? Hours? Secinds?

240 Head_Flying_Hours       0x0000   100   253   000    Old_age   Offline      -       34497177323342

240 Head_Flying_Hours       0x0000   100   253   000    Old_age   Offline      -       9345848837966

 

 

 

 

Sent from my rooted HTC Supersonic using Tapatalk 2 Pro

Share this post


Link to post
Share on other sites

It will likely need to be normalized, but doing that may difficult if there isn't a formula already. Values depends on the FW, how the manufacturers sets them, issues with the program/interface, which can make it hard

It does not seem to a be a critical value, so I would not worry about. Looks like overall, SMART passed which is good. There are critical values that you can look into to.

  • Read Error Rate [stores data related to the rate of hardware read errors that occurred when reading data from a disk surface]
  • Reallocated Sector Count [When the hard drive finds a read/write/verification error, it marks this sector as "reallocated" and transfers data to a special reserved area (spare area).]
  • Spin Retry Count [stores a total count of the spin start attempts to reach the fully operational speed.]
  • End to End Error [after transferring through the cache RAM data buffer the parity data between the host and the hard drive did not match]
  • Command Timeout [The count of aborted operations due to HDD timeout]
  • Reallocation Event Count [Count of sector remap operations]
  • Current Pending Sector Count [Count of "unstable" sectors (waiting to be remapped, because of read errors). ]
  • Uncorrectable Sector Count [The total count of uncorrectable errors when reading/writing a sector.]
  • Soft Read Error Rate [Count of off-track errors.]

http://www.easis.com/smart-value-interpretation.html

Clicking can be normal in some situations...some drives are known to click and it could be related to normal seek operations, head parking or power management features.

My suggestions:

1. Keep making backups/using RAID

2. Check SMART again with SeaTools

3. Check for FW(Seatools might have that option) else look for the latest FW version. It may help with the clicking as I think the ST3000DM001 was a model affected by it.

Share this post


Link to post
Share on other sites

I'm on linux. So not sure of Seagate tools unless I move drives to another box/install.

And I just finished growing the array with 2 more of the same identical drives, took 60hrs.

Smart shirt and long passed on the 2 new, and again on the older droves.

The click I can make happen every time I flick the case. Not even the drive itself. I'm not going to experiment anymore until the reshape and filesystem grow has complete... I don't have any backups. Nothing important, just too lazy to download/rip everything again.

Hope I didn't invest in flukes.

Sent from my rooted HTC Supersonic using Tapatalk 2 Pro

Share this post


Link to post
Share on other sites

Haha, I found some Seagate software for Linux. The 230KB program doesn't test sata/Pata/eide.

But the 16MB version on windows does.

Garbage company.. .

Oh well.

Sent from my rooted HTC Supersonic using Tapatalk 2 Pro

Share this post


Link to post
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!


Register a new account

Sign in

Already have an account? Sign in here.


Sign In Now
Sign in to follow this  
Followers 0