CSV file and its use
In the process of collecting images, you will definitely hear the word CSV file often. So what exactly is the so-called CSV file? What is it used for? And how to use it? Read the following text carefully and I believe you will have a preliminary understanding of CSV.
The so-called "CSV" is the English abbreviation of Comma Separated Value (comma separated value), which is usually a plain text file. Below is part of the content of an actual CSV file to give everyone a perceptual understanding of it. We chose the CSV file of Sjojo_Rescan (Sjojo is a member of ASW-Asia Scanning Style).
sj_mino1001.jpg,715282,4FB55FE8,
sj_mino1002.jpg,471289,93203C5C,
sj_mino1003.jpg,451929,C4E80467,
< p>Usually, there is no space left at the beginning of the CSV file. Each line records multiple data of a picture in line units. Each line of data is separated by commas (standard English commas). Generally speaking, the format of the .CSV file used for collecting pictures is as follows:File name, file size (in bytes), CRC check value, comment (can be omitted)
sj_mino1001.jpg,715282,4FB55FE8,
| | | |
File name File size CRC value comment (omitted)
If your If Microsoft Excel is installed on the machine, the .csv file will be opened by Excel by default. It should be noted that when you double-click a .CSV file and Excel opens it without making any modifications, Excel will often prompt whether you want to change it to the correct file format when closing it. At this time, be sure to select "No" ". Because Excel thinks that the numbers in the .CSV file should be expressed in scientific notation, and the numbers in the .CSV file we want are ordinary. If you select "Yes", Excel
will express all the numbers in the CSV file in scientific notation (in the form of 2.54932E+5), and the file will be unusable. Therefore, if most of your CSV files are used in collections, it is recommended to change the default opening method of .CSV to any text editor. The system's built-in Notepad is a good choice.
Okay, let’s go back to the format of the CSV file. See above, from left to right sj_mino1001.jpg is the file name and 715282 is the file size in bytes. When the file name contains a comma, due to the special role of commas in CSV files, in order to avoid ambiguity, the file name needs to be enclosed in quotation marks. For example, "The Art, Fantasy.jpg",384211,...,.... followed by an 8-digit hexadecimal number 4FB55FE8, which is the CRC32 check value of the file. The essence of the entire CSV file is here.
Let us give a small example to illustrate its significance: Sjojo released a picture sj_mino1001.jpg. Assuming that you got this picture from a friend, then you must often have something like this Question, has this picture been altered during the dissemination process? Was it accidentally damaged? CRC exists to solve this problem. Under normal circumstances, every time a picture scanner releases a set of pictures, it also releases a CSV file (this CSV is usually called Official CSV, which means officially released CSV). The CRC value in this CSV is passed through special software. The CRC32 algorithm (commonly used CRC algorithms include CRC16) generates a value after operating on the file. This value can be used as a symbol of the file's authenticity. In most cases, if the file has been modified in size or content during the propagation process. Then, the CRC check value generated after using the same CRC32 algorithm to operate the file is completely different. If the CRC values ??obtained are the same, there is every reason to believe that the file is the original file and has not been altered. Sometimes, the file name of the received picture has been changed, so how do you know who is who? At this time, the CRC value comes into play again: after using special software to process the image, you can get the file size and CRC value, and then search whether there is a suitable image in the CSV based on the size and CRC value. If so, the name of the image will be changed to the one in the CSV. For example, you have received a picture named pic0001.jpg. At the same time, you know that this picture is in the Sjojo_Rescan collection, but you do not know which one it is.
Use the software to get its size and CRC to be 715282 and 4FB55FE8 respectively. Then the software finds a line of sj_mino1001.jpg, 715282, 4FB55FE8 in the CSV file. The size and CRC are consistent. The software thinks that the original name of this picture is sj_mino1001.jpg, and then Automatically change the file name to sj_mino1001.jpg. What is changed is not the CSV file but the file name of the image. What follows are notes and explanations, which can be omitted. It should be noted that there is no comma after the comment. If you want to omit the comment, you must leave a comma after the CRC value, otherwise the software will consider the CRC value as a comment.
I believe that after reading a lot of things above, you not only have a general understanding of CSV files, but also some vague concepts about its uses. We often collect pictures through a variety of different channels, such as from websites, IRC, or simply CDs sent by friends. During the entire dissemination process, pictures may be subjected to various inhumane treatments. Especially pictures collected from websites. Sometimes it's just a simple name change. More often than not, due to limited space on many PLMM sites, the size of the picture is often reduced, or the picture file is reduced. It is common for a 1356x588 image to be changed to 678x294, or a 400-500K file to be reduced to dozens of KB. What's even more disgusting is that many websites have removed the original logo of the photo scanner and replaced it with their own website's logo. There are also some funny players who have modified various pictures and transformed them into new ones. These are not harmful to the general picture entertainment, but for the collection of pictures, receiving such pictures is intolerable. How to verify the authenticity of a picture is a problem faced by both photo scanners and photo collectors. CSV files appeared to solve such problems. CSV files were first used in simple databases. Because of their simple format and strong openness, they were initially used by image scanners as markers for their own atlases. As mentioned above, CSV files and their uses
In the process of collecting images, you will definitely hear the word CSV files often. So what exactly is the so-called CSV file? What is it used for? And how to use it? Read the following text carefully and I believe you will have a preliminary understanding of CSV.
The so-called "CSV" is the English abbreviation of Comma Separated Value, which is usually a plain text file. Below is part of the content of an actual CSV file to give everyone a perceptual understanding of it. We chose the CSV file of Sjojo_Rescan (Sjojo is a member of ASW-Asia Scanning Style).
sj_mino1001.jpg,715282,4FB55FE8,
sj_mino1002.jpg,471289,93203C5C,
sj_mino1003.jpg,451929,C4E80467,
< p>Usually, there is no space left at the beginning of the CSV file. Each line records multiple data of a picture in line units. Each line of data is separated by commas (standard English commas). Generally speaking, the format of the .CSV file used for collecting pictures is as follows:File name, file size (in bytes), CRC check value, comment (can be omitted)
sj_mino1001.jpg,715282,4FB55FE8,
| | | |
File name File size CRC value comment (omitted)
If your If Microsoft Excel is installed on the machine, the .csv file will be opened by Excel by default. It should be noted that when you double-click a .CSV file and Excel opens it without making any modifications, Excel will often prompt whether you want to change it to the correct file format when closing it. At this time, be sure to select "No" ". Because Excel thinks that the numbers in the .CSV file should be expressed in scientific notation, and the numbers in the .CSV file we want are ordinary. If you select "Yes", Excel
will express all the numbers in the CSV file in scientific notation (in the form of 2.54932E+5), and the file will be unusable. Therefore, if most of your CSV files are used in collections, it is recommended to change the default opening method of .CSV to any text editor. The system's built-in Notepad is a good choice.
Okay, let’s go back to the format of the CSV file. See above, from left to right sj_mino1001.jpg is the file name and 715282 is the file size in bytes. When the file name contains a comma, due to the special role of commas in CSV files, in order to avoid ambiguity, the file name needs to be enclosed in quotation marks.
For example, "The Art, Fantasy.jpg",384211,...,.... followed by an 8-digit hexadecimal number 4FB55FE8, which is the CRC32 check value of the file. The essence of the entire CSV file is here.
Let us give a small example to illustrate its significance: Sjojo released a picture sj_mino1001.jpg. Assuming that you got this picture from a friend, then you must often have something like this Question, has this picture been altered during the dissemination process? Was it accidentally damaged? CRC exists to solve this problem. Under normal circumstances, every time a picture scanner releases a set of pictures, it also releases a CSV file (this CSV is usually called Official CSV, which means officially released CSV). The CRC value in this CSV is passed through special software. The CRC32 algorithm (commonly used CRC algorithms include CRC16) generates a value after operating on the file. This value can be used as a symbol of the file's authenticity. In most cases, if the file has been modified either in size or content during the propagation process. Then, the CRC check value generated after using the same CRC32 algorithm to operate the file is completely different. If the CRC values ??obtained are the same, there is every reason to believe that the file is authentic and has not been altered. Sometimes, the file name of the received picture has been changed, so how do you know who is who? At this time, the CRC value comes into play again: after using special software to process the image, you can get the file size and CRC value, and then search whether there is a suitable image in the CSV based on the size and CRC value. If so, the name of the image will be changed to the one in the CSV. For example, you have received a picture named pic0001.jpg. At the same time, you know that this picture is from the Sjojo_Rescan collection, but you do not know which one it is. Use the software to get its size and CRC to be 715282 and 4FB55FE8 respectively. Then the software finds a line of sj_mino1001.jpg, 715282, 4FB55FE8 in the CSV file. The size and CRC are consistent. The software thinks that the original name of this picture is sj_mino1001.jpg, and then Automatically change the file name to sj_mino1001.jpg. What is changed is not the CSV file but the file name of the image. What follows are notes and explanations, which can be omitted. It should be noted that there is no comma after the comment. If you want to omit the comment, you must leave a comma after the CRC value, otherwise the software will consider the CRC value as a comment.
I believe that after reading a lot of things above, you not only have a general understanding of CSV files, but also some vague concepts about its uses. We often collect pictures through a variety of different channels, such as from websites, IRC, or simply CDs sent by friends. During the entire dissemination process, pictures may be subjected to various inhumane treatments. Especially pictures collected from websites. Sometimes it's just a simple name change. More often than not, due to limited space on many PLMM sites, the size of the picture is often reduced, or the picture file is reduced. It is common for a 1356x588 image to be changed to 678x294, or a 400-500K file to be reduced to dozens of KB. What's even more disgusting is that many websites have removed the original logo of the photo scanner and replaced it with their own website's logo. There are also some funny players who have modified various pictures and transformed them. These are not harmful to the general picture entertainment, but for the collection of pictures, receiving such pictures is intolerable. How to verify the authenticity of a picture is a problem faced by both photo scanners and photo collectors. CSV files appeared to solve such problems. CSV files were first used in simple databases. Because of their simple format and strong openness, they were initially used by image scanners as markers for their own atlases. As mentioned above, the CSV file is a plain text file, and each line represents many attributes of an image. When you collect an atlas, as long as you can find its CSV file and verify it with special software, you will know the status of the atlas very well. For example, how many pictures are there in this set of pictures, how many pictures have you received, which ones are the original pictures, and which ones may have been altered or damaged... We can think of the .CSV file as an index. , you can not only "search for the horse according to the picture", but also check whether the bloodline of the "horse" you have collected is pure.
There are usually two sources of CSV files: one is the CSV published by the scanner itself, generally called Official CSV. That is, the officially released CSV is generally quite authoritative. What is special is that many photo scanners in Asia, in addition to publicly released photos, also have some Special photos released privately, or some photos specially given to friends. In addition, image scanners are unwilling or unable to make CSV, so the CSV of their atlases is often done by some senior seniors in the image collection industry. In this case, there are often several versions of CSV files for an atlas, and there are considerable differences between each version.
The CSV at this time is not only the basis for verifying the true identity, but also gives the map collector a hope: no matter how many are missing, no matter how few people use this version of CSV to collect all, there will definitely be one person who is complete, that is The author of this CSV! Li Qing, a senior figure in the collection industry, has a famous saying: "If there is CSV, it means there must be a picture!" Naturally, some authoritative CSV players have emerged among the public. For example, the early EDZONE, the still active OSun, shum, vsam, TE, Paul... and the WQC of our COC alliance
Okay, having said so much, you may ask, CSV files are so easy to use, but how to use them? You don't have to do it manually by yourself, do you? Haha, do you still remember the word "software" that appears again and again in the above article? There is specialized software that helps us do this. Commonly used software include The!Checker and The Hunter.
At present, both softwares have Chinese simplified Chinese versions. The Chinese version is snowtank from the north