- Jan 14, 2007
- 5
- 0
- 0
Here is the situation that I am facing. I have a large data file, about 50G to 100G, that is to be read from disk storage repeatedly by a Fortran program. I am looking at the possibility of using RAID 0, say 4 SATA disks, as a way of speeding up the reading of the data. Here are some questions that I would appreciate any input and discussions:
1. Sould I use the SATA slots on the motherboard or it would be better to use a controller card for RAID?
2. I am using linux as the operating system. What is the best file system to be used on the RAID disk for fast data access?
3. If I use mdadm to creat the array, is there an optimal value for chunk size for reading primarily large data files?
4. What realistic performance increase that I can expect from RAID 0 configuration? Would it be N*V where N the number of disks and V the read speed of single drive?
5. What are the things to look for when buying a motherboard or RAID controller for this purpose?
6. Any recommendations on particular motherboard/harddrive combinations from experiences?
A little test that I have experimented with the motherboard that I currently have is as follows. The board is P4P800-E, it has two SATA 1.5g/s connections. I have two identical Seagate 320MB .10 SATA 3g/s drives. The system is running on Mandrake 10.1. The drive is only recognized when the on board raid controller is turned on. The drives are recongnized as sda and sdb and hdparm returns 72MB/s on each device. I created RAID level 0 and the hdparm -t /dev/md0 returned only 83MB/s. The single drive is not giving 150MB/s as it should (right, for 1.5g/s?) and the raid 0 is not doubling the read speed.
I would appreciate any comments and suggestions.
Thanks!
/Hugoist