Skip to content

Commit 85b545b

Browse files
authored
Update itext_using_learning.md
1 parent 6c70723 commit 85b545b

File tree

1 file changed

+4
-4
lines changed

1 file changed

+4
-4
lines changed

source/_posts/itext_using_learning.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -59,16 +59,16 @@ public class App
5959
```
6060
## 3 想要提取的文档类型
6161
想要提取的文档如下:
62-
1 目录
63-
1.1 概述
64-
1.1.1 aaaa
62+
1 目录
63+
1.1 概述
64+
1.1.1 aaaa
6565
* desc:
6666
* * this is a test document!
6767
```
6868
#!/bin/bash
6969
some code
7070
```
71-
* summery
71+
* summery
7272
比如要提取1.1.1标题和desc,及对应的代码(**各部分对应的字体是不一样的**,如果字体是一样的,这个文档的排版就太差了)。进行翻译时,code部分内容是不翻译的,如果按照基本的提取方式,把所有的文档提取出来,那么代码的正常的文字就混在了一起,没有办法调用Google翻译了。
7373
## 4 思路
7474
为弄清楚要怎么提取这样文本特征,研究了下,基本例子中内容的提取流程。

0 commit comments

Comments
 (0)
pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy