# 處理 Html 標籤 ###### tags: `Java` `Jsoup` `html` [TOC] ## Remove HTML tag 1. 正則表達式 2. 使用 javax.swing.text.html.HTMLEditorKit 3. 使用 Jsoup 框架 ## Get Title From HTML ### Jsoup 獲取HTML文檔 ```=Java import org.jsoup.Jsoup; import org.jsoup.nodes.Document; public class Example{ public static void main( String[] args ) { Document document = Jsoup.connect("https://www.google.com").get(); System.out.println("title: " + document.title()); } } ``` #### :triangular_flag_on_post: 請求的網站有SSL加密,因而產生錯誤 :::danger javax.net.ssl.SSLHandshakeException: **PKIX path validation failed: java.security.cert.CertPathValidatorException: validity check failed** Caused by: sun.security.validator.ValidatorException: PKIX path validation failed: java.security.cert.CertPathValidatorException: validity check failed Caused by: java.security.cert.CertPathValidatorException: validity check failed Caused by: java.security.cert.CertificateExpiredException: NotAfter: Sat Nov 12 23:59:59 CST 2016 ::: 參考資料: [官方](https://www.javatpoint.com/jsoup-example-print-links-of-an-url) [Jsoup示例:提取給定url的標題](https://www.1ju.org/jsoup/print-title-of-an-url) [Jsoup Get Title From HTML](https://www.w3schools.blog/jsoup-get-title-from-html-example)