Skip to content

Commit 6b37c66

Browse files
committed
fixed little bug
1 parent 06e3c11 commit 6b37c66

File tree

2 files changed

+2
-1
lines changed

2 files changed

+2
-1
lines changed

source/_posts/itext_using_learning.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -70,6 +70,7 @@ some code
7070
```
7171
* summery
7272
比如要提取1.1.1标题和desc,及对应的代码(**各部分对应的字体是不一样的**,如果字体是一样的,这个文档的排版就太差了)。进行翻译时,code部分内容是不翻译的,如果按照基本的提取方式,把所有的文档提取出来,那么代码的正常的文字就混在了一起,没有办法调用Google翻译了。
73+
7374
## 4 思路
7475
为弄清楚要怎么提取这样文本特征,研究了下,基本例子中内容的提取流程。
7576
PdfTextExtractor.getTextFromPage(pdfReader, i)函数的定义如下:

source/_posts/phantomjs-sso-login.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
---
2-
title:PhantomJS SSO单点登录,加Cookie异常情况处理
2+
title: PhantomJS SSO单点登录,加Cookie异常情况处理
33
date: 2017-12-13 15:36:44
44
tags:
55
- 网络爬虫

0 commit comments

Comments
 (0)
pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy