bit rot

drag

Elite Member
Jul 4, 2002
8,708
0
0
One thing I have always been wondering about since I've started downloading all my OS's over the internet is what is the rate of destroyed or corrupted bits in a 650 meg download? I know that tcp/ip protocal allows to error correction and it resends packets that do not meet the pariaty bits.. But that can't stop everything. How much does digital information degrade from being compiled on someone's computer copy/moved, tar'd, zip'd, uploaded, downloaded , compressed on a cd, tested, dd'd into a iso file, uploaded to ftp, downloaded thru a hundred or so assorted routers and nodes of different speeds and technology, copied to my thru my network into my computer onto my harddrive, then burned to a cd, handled, get pop spilled on it, cleaned, get drop kicked by the cat, and finally installed on a freshly formated harddrive? Why the hell does this even work at all?

And how well does a modern harddrive store information, at what rate does the bits degrade on the surface of the platters? How long can you hope to store information reliably on a harddrive, or tape medium, or a profesional cd/dvd, a burned one?

What I don't understand is when a program gets some of it's bits radomly shuffled, why does it still work ok?

What are the weaknesses of checksumming? If one bit goes from 0 to 1 and another goes from 0 to 1, don't these cancel each others out. So am I right in thinking that we just realy on the law of statistics to gaurd our information? (Which is why I suppose backups are a nessessity)
 

AndyHui

Administrator Emeritus<br>Elite Member<br>AT FAQ M
Oct 9, 1999
13,141
16
81
You should check out Cyclic Redundancy Checking. CRCs are actually very good.
 

Shalmanese

Platinum Member
Sep 29, 2000
2,157
0
0
Hmm... CD's die all the time, its part of the cycle of life. Hard drives, you dont expect to die too often. And when they do, its usually pretty obvious as the entire HD becomes useless. Error checking sums are actually very good. Especially with dedicated data lines instead of dirty analog lines.
 

CTho9305

Elite Member
Jul 26, 2000
9,214
1
81
Originally posted by: dejitaru
So how many bytes are corrupted? One in a million?

Before or after correction? After correction, WAY fewer, or CD ISOs would hardly work at all.

edit: MD5 sums do much more than count 1s and 0s. In fact, it isn't really possible to come up with 2 different things that have the same MD5 sum in a given amount of time. If the MD5 of your file matches the MD5 of the original, you can be sure it is the right file, with no errors.
 

CTho9305

Elite Member
Jul 26, 2000
9,214
1
81
Originally posted by: dejitaru
What is MD5? If MD5 is so great, why not just restore the entire file from the MD5?

You can't go backwards with MD5. It is a one-way hash... that is why it is often used for password systems. In a unix environment, your password isn't actually stored anywhere. What is stored is the MD5 of your password. When you try to log in, it hashes your what you typed, and compares it to the hash that is stored. If it matches, then you got the password right.

The idea is that you can't generate two files with the same MD5, but not that no such files exist. Obviously for files longer than the MD5 size, more than one will result in the same hash.
 

Shalmanese

Platinum Member
Sep 29, 2000
2,157
0
0
The thing is that you need to have a REALLY corrupted file in order to get the same MD5 checksum. Were not talking about 1 or 2 bits so the actual chance of getting a corrupt file with the same MD5 checksum is incredibly remote. However, a MD5 checksum is only 128 bits so you cant re-extract the information out of it .
 

glugglug

Diamond Member
Jun 9, 2002
5,340
1
81
TCP checksum is 16-bit, not 32 like you would have in most archives.

So the chance of a corrupt packet going unnoticed is 1 in 65536.

So if something was REALLY bad on your network and literally 50% of your packets were getting corrupted and needing a retransmit on average this would make it through the TCP layer every 100MB or so using normal 1.5K packets. Actually the odds are better than this because the IP layer has its own checksum that will catch some of the errors the TCP checksum doesn't, there is error detection built into every modem since 9600bps (and even a some 2400bps modems have v.42), and chances are your PPP connection (or PPTP between the cable/DSL modem & ISP for broadband) has a 32-bit checksum on each packet, which if your connection was really bad and getting 50% errors would let one error through every 6 terabytes or so.
 

MrDudeMan

Lifer
Jan 15, 2001
15,069
94
91
Originally posted by: glugglug
TCP checksum is 16-bit, not 32 like you would have in most archives.

So the chance of a corrupt packet going unnoticed is 1 in 65536.

So if something was REALLY bad on your network and literally 50% of your packets were getting corrupted and needing a retransmit on average this would make it through the TCP layer every 100MB or so using normal 1.5K packets. Actually the odds are better than this because the IP layer has its own checksum that will catch some of the errors the TCP checksum doesn't, there is error detection built into every modem since 9600bps (and even a some 2400bps modems have v.42), and chances are your PPP connection (or PPTP between the cable/DSL modem & ISP for broadband) has a 32-bit checksum on each packet, which if your connection was really bad and getting 50% errors would let one error through every 6 terabytes or so.

those numbers sound a little bogus. 1 bit every 6TBs? if that is true...wow
 

dzt

Member
Jan 22, 2003
76
0
0
Originally posted by: MrDudeMan
Originally posted by: glugglug
TCP checksum is 16-bit, not 32 like you would have in most archives.

So the chance of a corrupt packet going unnoticed is 1 in 65536.

So if something was REALLY bad on your network and literally 50% of your packets were getting corrupted and needing a retransmit on average this would make it through the TCP layer every 100MB or so using normal 1.5K packets. Actually the odds are better than this because the IP layer has its own checksum that will catch some of the errors the TCP checksum doesn't, there is error detection built into every modem since 9600bps (and even a some 2400bps modems have v.42), and chances are your PPP connection (or PPTP between the cable/DSL modem & ISP for broadband) has a 32-bit checksum on each packet, which if your connection was really bad and getting 50% errors would let one error through every 6 terabytes or so.

those numbers sound a little bogus. 1 bit every 6TBs? if that is true...wow

for one statistical freak every 6TB, I'll give you a bigg "WOW"



computer goes GIGO while people wanna LO
 

drag

Elite Member
Jul 4, 2002
8,708
0
0
ok, so the checksumming in tcp is pretty rocken, so as long as you check your downloads with the md5 (asuming you have a checksum on a secure server to check it too) you'll be pretty safe thru the internet...

But how about your HD in constant use? How much bit drift is going to happen in a binary file that is accessed 30 or 40 times a day for 3 years? I know the magnet head has very little effect on static content, but is that cumulative? Could this be the cause of having to reformat your drive periodicly, or is that just a build up of user errors and bugs in programs slowly corrupting themselves...

We all heard of the ignorant computer user in a office setting using the side of ther computer to hold up various pictures of there children with refrigurator magnets...
 

dejitaru

Banned
Sep 29, 2002
627
0
0
It isn't the HD, but your OS screws itself, damaging system files, corrupting data. That's why it's good to backup.
 

MrDudeMan

Lifer
Jan 15, 2001
15,069
94
91
Originally posted by: dzt
Originally posted by: MrDudeMan
Originally posted by: glugglug
TCP checksum is 16-bit, not 32 like you would have in most archives.

So the chance of a corrupt packet going unnoticed is 1 in 65536.

So if something was REALLY bad on your network and literally 50% of your packets were getting corrupted and needing a retransmit on average this would make it through the TCP layer every 100MB or so using normal 1.5K packets. Actually the odds are better than this because the IP layer has its own checksum that will catch some of the errors the TCP checksum doesn't, there is error detection built into every modem since 9600bps (and even a some 2400bps modems have v.42), and chances are your PPP connection (or PPTP between the cable/DSL modem & ISP for broadband) has a 32-bit checksum on each packet, which if your connection was really bad and getting 50% errors would let one error through every 6 terabytes or so.

those numbers sound a little bogus. 1 bit every 6TBs? if that is true...wow

for one statistical freak every 6TB, I'll give you a bigg "WOW"



computer goes GIGO while people wanna LO

wtf did that mean? can someone translate? learn to speak english, and if you are going to speak slang, at least dont be a moron about it.
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |