验尿细菌高是什么原因| 老人助听器什么牌子好| 蛋白质变性的本质是什么| 冰箱为什么结冰| 打喷嚏流鼻涕属于什么感冒| 例假发黑是什么原因| 男生什么时候会有生理反应| 磁共振是检查什么的| 光屏是什么| 前列腺不能吃什么食物| 体育精神是什么| 耳鸣是什么意思| 哔哩哔哩是什么网站| 血清碱性磷酸酶高是什么意思| 机电一体化学什么| 人子是什么意思| 世界第一大运动是什么| 徐州二院全名叫什么| 人为什么会做梦科学解释| 平均血小板体积偏高是什么意思| 周杰伦为什么叫jay| 鼻窦炎吃什么抗生素| 南昌有什么好玩的景点| 11.15是什么星座| 头部神经痛吃什么药好| 梦见自己爬山是什么意思| 胸小是缺少什么营养| 双肺纹理粗重什么意思| 春占生女是什么意思| 磨豆腐是什么意思| 眼仁发黄是什么原因| 婚动是什么意思| 花絮是什么意思| 纸老虎比喻什么样的人| 大学硕士点是什么意思| 手足口病吃什么药好得快| 精神病人最怕什么刺激| 全血低切相对指数偏高什么意思| 头发秃一块是什么原因| 扁桃体结石是什么原因引起的| hpv检查什么项目| 轻度脂肪肝有什么症状| 口臭舌苔白厚吃什么药| 风水是什么意思| cr是什么检查| 干咳嗽无痰是什么原因| 梦见抽血是什么预兆| 一只眼皮肿是什么原因| 狮子座和什么星座最配| 低血压吃什么水果| 球蛋白适合什么人打| 晚上八点到九点是什么时辰| 高血糖适合吃什么主食| 腐男是什么意思| 两面性是什么意思| ooxx是什么意思| 梦见双头蛇是什么征兆| 树上长的像灵芝的是什么| 什么球会自己长大| 12583是什么电话| 幽门螺旋杆菌什么症状| 蔓越莓有什么功效和作用| 膜性肾病什么意思| 女方起诉离婚需要什么证件| 小腿肿看什么科| 血小板低吃什么好| 健康证明需要检查什么| 吃什么能增肥最快| 植物是什么| 女兔配什么属相最好| 乳糜血是什么意思| 脑供血不足什么原因| mect是什么意思| 藏红花什么时候喝最好| 肾囊性灶是什么意思| 谷胱甘肽是什么| 咬肌疼是什么原因| 疱疹用什么药膏| 副旅长是什么军衔| 频发房性早搏是什么意思| 急性胆囊炎吃什么药| 贡高我慢是什么意思| 哈尼什么意思| 热闹非凡是什么意思| 净土是什么意思| 红薯什么时候掐尖| 手机壳什么材质最好| 大便长期不成形是什么原因| 紫苏叶有什么作用| 面基什么意思| 学名是什么意思| 胃疼是什么感觉| 身上老是痒是什么原因| 一般什么人容易得甲亢| 药剂师是什么专业| 唐氏综合症是什么原因| sp是什么意思| 磨玻璃结节影是什么意思| 对节木是什么树| 热感冒吃什么食物好| 蚂蝗吃什么| 猕猴桃什么季节成熟| 肾病可以吃什么水果| 小丑什么意思| 什么体质容易怀双胞胎| 菊花茶有什么功效| 腐败什么意思| 防微杜渐什么意思| 农历四月是什么月| 香蕉不能和什么同吃| 撮鸟是什么意思| 可乐喝多了有什么危害| 肝损害是什么意思| 肺有小结节要注意什么| 暴躁是什么意思| 脚底出汗是什么原因女| 三月十号是什么星座| 覆盆子有什么作用| 鸡婆是什么意思| 纹眉需要注意什么| 什么药可以延长射精| 排卵试纸强阳说明什么| 冠状动脉钙化什么意思| 奔跑吧什么时候播出| 梦见发面是什么意思| 雪莲是什么| 咸鱼是什么意思| 蛤蚧是什么动物| 吃惊的什么| 室性早搏是什么意思| 双侧上颌窦炎是什么病| 西装裤配什么上衣| 肩膀疼应该挂什么科| 什么是鸡奸| 土豆和什么不能一起吃| 作业是什么意思| 肾轻度积水是什么意思| 维生素b12高是什么原因| 去医院检查艾滋病挂什么科| 马瘦毛长是什么意思| 腋下看什么科| 溜号是什么意思| 上校是什么级别| 总经理是什么级别| 什么是梅毒| 卦不走空是什么意思| 男人阴囊潮湿吃什么药| 颈椎病头疼吃什么药| 手脚麻木是什么原因引起的| 香港有什么好玩的| 闹乌龙是什么意思| 熟褐色是什么颜色| 电磁炉什么牌子好| 肠子粘连有什么办法解决| 小鸭吃什么| 脉络膜裂囊肿是什么病| 扁平疣是什么样子图片| p是什么单位| 复古红是什么颜色| 银子有什么功效与作用| 过年为什么要吃饺子| 癖是什么意思| 什么叫变应性鼻炎| 腐竹是什么| cc什么意思| 来姨妈喝什么比较好| 下午右眼跳是什么预兆| 砸是什么意思| 神经衰弱吃什么药效果最好| 九月一日什么节日| 脑血栓适合吃什么水果| 脑梗吃什么食物| 什么医院才是正规医院| 勃起困难是什么原因造成的| 都有什么水果| 宫外孕是什么原因造成的| 漏斗胸是什么病| 一什么蜘蛛| 乌龟的天敌是什么动物| 穿什么颜色衣服显白| 微循环是什么意思| 老人不睡觉是什么预兆| 1948属什么生肖| 眩晕吃什么药好| 高铁跟动车有什么区别| 手指关节痛挂什么科| 用脚尖走路有什么好处| 午餐肉是什么肉| 脚后跟痛是什么问题| 为什么会尿道感染| 青春痘是什么原因引起的| 芒果是什么意思| qw医学上是什么意思| 每天早上起来口苦是什么原因| 老人出汗多是什么原因| 自闭症是什么人投胎| 非转基因是什么意思| 血燥是什么意思| 巨蟹座和什么最配| 牛油果和什么不能一起吃| 疝气是什么原因引起的| 周瑜是什么生肖| 风热感冒和风寒感冒有什么区别| 清谷天指的是什么| 欧莱雅适合什么年龄| clinique是什么牌子的化妆品| 长湿疹是什么原因引起的| 县检察长是什么级别| 紧急避孕药有什么副作用| 鸟字旁有什么字| gd什么意思| 喝什么茶最养胃| 什么是有机奶粉| 金风玉露是什么意思| 生酮是什么| 双侧腋窝淋巴结可见什么意思| hpv阴性什么意思| 维生素e的功效与作用是什么| 来姨妈喝什么比较好| 阁僚是什么意思| 怎么看微信好友什么时候加的| 小孩肚脐眼周围疼是什么原因| 吃牛肉不能吃什么| 宫腔内囊性回声是什么意思| 宝宝辅食虾和什么搭配| 59岁属什么生肖| 鲮鱼是什么鱼| 吃蒜有什么好处| 干将是什么意思| up是什么意思| 手指脱皮是缺什么维生素| 灯火通明是什么生肖| 西湖醋鱼是什么菜系| abo溶血症是什么意思| 全脂牛奶是什么意思| 六月六日是什么星座| hcg翻倍不好是什么原因造成的| 没学历可以学什么技术| 子宫内膜厚什么原因引起的| 反流性食管炎b级是什么意思| 101什么意思| click什么意思| 细菌性阴道炎用什么药好得快| 治疗便秘吃什么| 腰上有痣代表什么| 小狗感冒了吃什么药| 老年人心跳过快是什么原因| 巨婴是什么意思| 胸闷气短挂什么科室| 种植牙为什么那么贵| 过敏性咳嗽用什么药| 1965属什么生肖| 千人千面是什么意思| 禾加术念什么| 给事中是什么官| 梦见长大水是什么意思| 戴菊是什么| 杀破狼是什么意思| 鸡头上长痘痘用什么药| 梦见钱是什么意思| 脾的作用和功能是什么| 地米是什么药| 梦见老板是什么意思| 一什么一什么词语| 百度Jump to content

口舌是非是什么意思

From Wikisource
Digitising texts and images for Wikisource
百度 中国教育部长陈宝生、日本文部科学大臣林芳正、韩国副总理兼教育部长官金相坤出席了会议。

Shortcut:
H:SCAN

The material on Wikisource should ideally be proofread from a scan of the original, physical text: a book, magazine, newspaper, etc. The first step in this process is therefore scanning and digitising the text in the first place. If a scan cannot be found already available,[1] then it will need to be scanned by a Wikisource volunteer. These instructions refer to a book being scanned, but apply equally to other print media.

Wikisource:Scan Lab can provide assistance with creating or repairing scans.

First steps

[edit]

This help page assumes that you have access to a complete copy of the original work and that you have checked the copyright status to ensure that it is lawful to scan and upload the work to Wikisource. If you have not already done so, please check this now as you may find the end product of all your efforts ultimately barred from being hosted here for non-compliance with the copyright laws, policies or practices otherwise. This process is described in Help:Adding texts and Help:Adding images.

Scanning

[edit]

Scanning works can be done in one of several different ways, using different equipment.

The scanning of bound books can be difficult due to the binding. A book is an irregularly-shaped object and does not fit neatly into normal scanning devices. Care must also be taken not to damage books in the process of scanning them, unless destructive scanning is used.

V-cradle scanners

[edit]
An Internet Archive book scanner with a V-shaped cradle.

The best means of scanning a book is a special scanner with a V-shaped cradle. This supports the book in a natural reading position, keeping the pages flat without damaging the book's spine. It is also very fast, as pages can simply be flipped as normal. Commercial book scanners of this type can be very expensive. Amateur and custom-made versions can be much cheaper, but need to be built from scratch.

A DIY version involves a cradle and one or two digital cameras to take the individual page scans. The cradle can be made of any material from cardboard, to wood, to metal; it must hold the book at a 90° angle, with each side of the book at 45° to the vertical. The camera must be pointed directly at the page and aligned properly; if not, the scan will appear skewed. Depending on the size of the book, the cradle may need to be adjustable to maintain the same angle with the camera as the scan proceeds (the thickness of the book will transfer from one side to the other, altering the centre position of the pages with respect to the cradle and the camera and causing gradual skewing of the output). A glass pane (either bought specially or adapted from a common picture frame) will be necessary to hold the pages flat during scanning. Lighting should be diffuse to provide even lighting of the pages. While the human eye can adjust to different levels of lighting, it will be especially noticable to computer software at the processing stage and it will interfere with the optical character recognition. Direct lighting may also cause glare on the glass pane holding the pages flat.

Flat-bed scanners

[edit]
Plustek book scanner, where the scanning area continues to the edge of the device.
Bookeye 3 overhead book scanner.

Flat-bed scanners are not as good as a v-cradle for scanning a book but they are the next best choice and different versions exist. These devices can also be expensive to purchase.

One version is special flat-bed device where the scan goes into the very edge of the device, allowing one side of a book to be laid flat with the hinge side of the binding at the outside edge of the scanner.[2]

The other version is an over head scanner. The book is laid open beneath the scanner and it takes an image of either one page or both visible pages together. There will be some distorting in the scan towards the hinge of the book as the pages bend inwards rather than lie uniformally flat.

Alternatively, instead of using a special flat-bed book-scanner, books can be pressed against a standard flat-bed scanner. The same distorting described with the overhead scanner will also occur using this method. Pressing a book flat in this manner may also damage the book and its binding.

Flat-bed scanners have a limited scanning area due to the size of the machine. These devices are usually in A4 format and will take up to a quarto (approx 10in x 8in) book page size. Bigger pages than that need an A3 scanner. An alternative is to use a photocopier to reduce bigger pages to A4 format and scan the photocopies.

Photocopiers and multi-functional devices

[edit]

Some modern office equipment includes a scanning function and processing software. The limitations described for flat-bed scanners apply here as well.

Digital cameras

[edit]

Although not as reliably high quality as scanning, simply taking photos of documents is a perfectly viable means of digitising. It's generally quicker and easier, especially as a camera is often permitted or possible where a scanner would not be. For an example of a document prepared using direct hand-held photography, see Base Facilities Report.

NB: If using a tripod, monopod or other stand, the function of a v-cradle or overhead flat-bed book scanner can be replicated using a normal digital camera.

Destructive scanning

[edit]

Destructive scanning is not recommended, but it should be mentioned for completeness. This method avoids the problems in scanning irregularly shaped books as described above. As the name implies, this destroys the physical book as part of the scanning.

Destructive scanning means taking the book apart. This may involve cutting the pages free from the binding or removing staples, stitching or other parts of the book. The result will be a stack of loose pages instead of a bound book. These pages can then be laid flat on a scanning device and can even make use of an automatic feeder.

This is faster and easier than any other form of scanning, but, again, it will destroy the book.

Processing

[edit]

Once you have your scans, you will need to process them into a single file. A scanned text should be a single file in the DjVu container file format. Some scanners may be able to output your scans in one of these formats. Many, however, will produce a series of individual page scans, probably in JPEG or JPEG2000 format. These need to be converted to the container format.

Pre-processing

[edit]

Before creating the single file, it is a good idea to make copies or otherwise set aside any page scan with an illustration or other image. These will need to be extracted and uploaded separately so they can be added to the final work during proofreading. Images should be extracted from the raw, unprocessed scans whenever possible. Any processing may result in lower image quality, especially if certain image file formats get repeatedly saved and compressed. Additionaly, the process of combining page scans to a single file involves some compression; PDF uses less compression than DjVu, but either will result in slightly inferior image quality. So images should not be extracted from the single file unless no other options exist. The original images are likely to be the best quality available to you.

Before creating the single file you may wish to alter the page images. Depending on the scanning method and circumstances, some or all pages may be skewed. They may need to be rotated, cropped, deskewed or otherwise manipulated. If the scans combine two pages into one image file they will need to be split into separate files. The goal is for each page scan to be an accurate image of a single, flat page from the original work.

Individual scans may need to be renamed. Some processing requires that the pages are in the correct order when sorted alphabetically. Using the filename such as "Name000", where Name is an identifier for the work and 000 is an incrementing page number, is a common way of achieving this. Some scanning methods may produce two sets of scans, separating the scans of the left- and right-hand pages, which will need to be recombined. In this case, it is best to rename each set separately, using an increment of 2 for each, so when they are copied into the same folder they are already in the correct order. (If one set was scanned from the back of the book to the front it will need to start at the highest page number and increment by -2 in order to fit with the other set.) The program IrfanView can perform batch renaming.

Some people choose to desaturate page images, creating a black and white image instead of colour. This is not recommended. It will reduce the final file size but this is no longer an important consideration with modern technology. Colour pages scans include more information than monochrome version; for example, they may include brown stains, and other discolouration, over black text which will be legible in colour but completely obscured in black and white.

File creation

[edit]

The easiest method of processing scans is to upload them to the Internet Archive, which will perform this operation for you. Thus, create a zip file with the single-page scans, then upload it to the Internet Archive. After some time, a PDF file with an OCR text layer will be generated.

See Help:DjVu files#The Internet Archive for details.

Images and illustrations

[edit]

All illustrations and other images from your work need separate image files. They cannot be transferred directly from the scan file to the finished proofread transcription.

You should have saved your original page scans or set aside those with illustrations during the pre-processing stage. The images need to be extracted from these into a form usable by Wikisource and any re-users of our works. This will at least involve cropping the pages, but may require more processing (including but not limited to rotation, de-skewing, colour and level adjustment, addition of an alpha channel (transparency) and more). The free image processing software GIMP is useful for this.

When saving, please choose the most appropriate format for each individual image. The JPEG file format is best for photographs and details colour illustrations. The PNG file format is best for diagrams or simple monochrome illustrations.

Uploading

[edit]

Once you have created your file and extracted any illustrations, they should be uploaded to Wikimedia Commons.

If you have used a website to process the scan, and it has a stable URL, you can transfer it directly to Commons using the URL2Commons tool (see Help:URL2Commons). Otherwise you will need to download the file to your computer first and upload it the normal way. In the latter case, or if you created the file yourself, you should follow the normal uploading instructions at Wikimedia Commons.

If there is more than one file involved (for example, if there are illustrations) it can be useful to create a special category for the files. This should hold the scan and all related files in one place. This will be useful for you, or anyone else, when finding a specific file and for administrative purposes such as movement, renaming and recategorisation.

In a few cases, Wikimedia Commons will not accept some files. This is due to additional policies at Commons on top of the minimum legal requirement (works still under copyright in their home country but public domain the United States are not allowed on Commons). If this is the case, the files can be uploaded directly to Wikisource. All other advice still applies.

Notes

[edit]
  1. Scans can often be found on sites such as the Internet Archive or Google Books.
  2. The Plustek OpticBook 3600 is an example of a special flat-bed scanner for book scanning.

See also

[edit]
[edit]
  • Awesome Scanning: "A curated list of awesome projects to simplify and improve paper scanning"

Software

[edit]

Scanners

[edit]

OCR

[edit]

Guidelines

[edit]
槟榔中间的膏是什么 不自觉是什么意思 迪奥是什么 血糖高是什么意思 9月14是什么星座
肚子腹泻是什么原因 满清十大酷刑是什么 吃什么降肌酐 dady是什么意思 料油是什么油
项韧带钙化是什么意思 小叶紫檀五行属什么 盆腔积液是什么症状表现 乙肝有什么明显的症状 对牛弹琴告诉我们什么道理
尿道感染有什么现象 吃什么可以降糖 眼睛模糊是什么原因引起的 女人下巴有痣代表什么 三高挂号挂什么科
长智齿一般什么年龄hcv9jop4ns6r.cn 咳嗽适合吃什么水果hcv8jop0ns8r.cn 真菌是什么原因引起的dajiketang.com 耋是什么意思hcv9jop0ns2r.cn 什么是赤道hcv9jop0ns7r.cn
处暑是什么时候chuanglingweilai.com 鬼死了叫什么hcv9jop4ns6r.cn 脾胃不好吃什么食物好hcv7jop9ns1r.cn 食管炎吃什么药最好hcv8jop2ns4r.cn 手肘发黑是什么原因hcv8jop9ns6r.cn
忌日是什么意思hcv9jop1ns3r.cn o型血与b型血生的孩子是什么血型hcv8jop8ns8r.cn 成也萧何败也萧何什么意思hcv8jop0ns2r.cn 逆商是什么hcv9jop6ns9r.cn 解表散热什么意思naasee.com
尸臭是什么味道hcv9jop8ns3r.cn 粘胶是什么材质youbangsi.com 天麻是什么东西hcv7jop6ns8r.cn primark是什么牌子hcv9jop4ns2r.cn 非分之想是什么意思hcv8jop5ns5r.cn
百度