当前位置: 首页 > news >正文

新增备案网站要关闭吗面点培训学校哪里有

新增备案网站要关闭吗,面点培训学校哪里有,做采集网站赚钱吗,企网站建设一、技术背景与目标 Wiki系统导出的文档通常以HTML格式存在,且内容分散在多个文件中,每个页面对应一个HTML文件。然而,Microsoft Word(Word)在处理HTML文件时,仅支持单个HTML文件的导入。因此,为了将Wiki导出的内容转换为Word可识别的格式,必须将分散的HTML文件整合为一…

一、技术背景与目标

Wiki系统导出的文档通常以HTML格式存在,且内容分散在多个文件中,每个页面对应一个HTML文件。然而,Microsoft Word(Word)在处理HTML文件时,仅支持单个HTML文件的导入。因此,为了将Wiki导出的内容转换为Word可识别的格式,必须将分散的HTML文件整合为一个单一的HTML文件。这一过程涉及HTML文件的解析、内容提取、结构重组以及样式调整等多个技术要点。

二、代码逻辑与技术要点解析

(一)WikiToHtml:生成单个HTML文件

WikiToHtml类的核心功能是将分散的Wiki HTML文件整合为一个单一的HTML文件,同时生成导航结构以便在Word中浏览。
1. 清理与初始化
在处理HTML文件之前,首先需要清理目标目录,移除旧文件以避免冲突。clearn方法通过检查文件是否存在并删除它们来实现这一功能:

<span style="color:#060607"><span style="background-color:#ffffff"><span style="background-color:#fafafa"><span style="color:#383a42"><code class="language-java"><span style="color:#a626a4">public</span> <span style="color:#a626a4">static</span> <span style="color:#a626a4">void</span> <span style="color:#4078f2">clearn</span><span style="color:#383a42">(</span><span style="color:#383a42">)</span> <span style="color:#383a42">{</span><span style="color:#b76b01">File</span> filehhc <span style="color:#4078f2">=</span> <span style="color:#a626a4">new</span> <span style="color:#b76b01">File</span><span style="color:#383a42">(</span>hhcurl<span style="color:#383a42">)</span><span style="color:#383a42">;</span><span style="color:#a626a4">if</span> <span style="color:#383a42">(</span>filehhc<span style="color:#383a42">.</span><span style="color:#4078f2">exists</span><span style="color:#383a42">(</span><span style="color:#383a42">)</span><span style="color:#383a42">)</span> <span style="color:#383a42">{</span>filehhc<span style="color:#383a42">.</span><span style="color:#4078f2">delete</span><span style="color:#383a42">(</span><span style="color:#383a42">)</span><span style="color:#383a42">;</span><span style="color:#383a42">}</span><em><span style="color:#a0a1a7">// 其他文件的清理逻辑...</span></em>
<span style="color:#383a42">}</span></code></span></span></span></span>
此逻辑确保每次运行程序时,目标目录都是干净的,避免旧文件干扰。
2. 文件遍历与路径收集
getFile方法递归遍历指定目录,收集所有HTML文件的路径,并将这些路径存储到一个StringBuffer中:

<span style="color:#060607"><span style="background-color:#ffffff"><span style="background-color:#fafafa"><span style="color:#383a42"><code class="language-java"><span style="color:#a626a4">public</span> <span style="color:#a626a4">static</span> <span style="color:#a626a4">void</span> <span style="color:#4078f2">getFile</span><span style="color:#383a42">(</span><span style="color:#b76b01">File</span> file<span style="color:#383a42">,</span> <span style="color:#b76b01">StringBuffer</span> str<span style="color:#383a42">)</span> <span style="color:#383a42">{</span><span style="color:#b76b01">File</span><span style="color:#383a42">[</span><span style="color:#383a42">]</span> fileitem <span style="color:#4078f2">=</span> file<span style="color:#383a42">.</span><span style="color:#4078f2">listFiles</span><span style="color:#383a42">(</span><span style="color:#383a42">)</span><span style="color:#383a42">;</span><span style="color:#a626a4">for</span> <span style="color:#383a42">(</span><span style="color:#a626a4">int</span> i <span style="color:#4078f2">=</span> <span style="color:#b76b01">0</span><span style="color:#383a42">;</span> i <span style="color:#4078f2"><</span> fileitem<span style="color:#383a42">.</span>length<span style="color:#383a42">;</span> i<span style="color:#4078f2">++</span><span style="color:#383a42">)</span> <span style="color:#383a42">{</span><span style="color:#a626a4">if</span> <span style="color:#383a42">(</span>fileitem<span style="color:#383a42">[</span>i<span style="color:#383a42">]</span><span style="color:#383a42">.</span><span style="color:#4078f2">isDirectory</span><span style="color:#383a42">(</span><span style="color:#383a42">)</span><span style="color:#383a42">)</span> <span style="color:#383a42">{</span><span style="color:#4078f2">getFile</span><span style="color:#383a42">(</span>fileitem<span style="color:#383a42">[</span>i<span style="color:#383a42">]</span><span style="color:#383a42">,</span> str<span style="color:#383a42">)</span><span style="color:#383a42">;</span><span style="color:#383a42">}</span> <span style="color:#a626a4">else</span> <span style="color:#383a42">{</span><span style="color:#a626a4">try</span> <span style="color:#383a42">{</span>str<span style="color:#383a42">.</span><span style="color:#4078f2">append</span><span style="color:#383a42">(</span>fileitem<span style="color:#383a42">[</span>i<span style="color:#383a42">]</span><span style="color:#383a42">.</span><span style="color:#4078f2">getCanonicalPath</span><span style="color:#383a42">(</span><span style="color:#383a42">)</span> <span style="color:#4078f2">+</span> <span style="color:#50a14f">"\r\n"</span><span style="color:#383a42">)</span><span style="color:#383a42">;</span><span style="color:#383a42">}</span> <span style="color:#a626a4">catch</span> <span style="color:#383a42">(</span><span style="color:#b76b01">IOException</span> e<span style="color:#383a42">)</span> <span style="color:#383a42">{</span>e<span style="color:#383a42">.</span><span style="color:#4078f2">printStackTrace</span><span style="color:#383a42">(</span><span style="color:#383a42">)</span><span style="color:#383a42">;</span><span style="color:#a626a4">if</span> <span style="color:#383a42">(</span>log <span style="color:#4078f2">!=</span> <span style="color:#a626a4">null</span><span style="color:#383a42">)</span> <span style="color:#383a42">{</span>log<span style="color:#383a42">.</span><span style="color:#4078f2">addLog</span><span style="color:#383a42">(</span>e<span style="color:#383a42">.</span><span style="color:#4078f2">getMessage</span><span style="color:#383a42">(</span><span style="color:#383a42">)</span><span style="color:#383a42">)</span><span style="color:#383a42">;</span><span style="color:#383a42">}</span><span style="color:#383a42">}</span><span style="color:#383a42">}</span><span style="color:#383a42">}</span>
<span style="color:#383a42">}</span></code></span></span></span></span>
此方法通过递归遍历目录,确保所有HTML文件的路径都被收集,为后续的文件处理提供基础。
3. HTML内容整合
modifyHtml方法负责处理每个HTML文件,移除不需要的元素(如页眉、页脚、附件等),并调整样式以适应Word的显示需求:

<span style="color:#060607"><span style="background-color:#ffffff"><span style="background-color:#fafafa"><span style="color:#383a42"><code class="language-java"><span style="color:#a626a4">public</span> <span style="color:#a626a4">static</span> <span style="color:#a626a4">void</span> <span style="color:#4078f2">modifyHtml</span><span style="color:#383a42">(</span><span style="color:#383a42">)</span> <span style="color:#383a42">{</span><span style="color:#b76b01">File</span> homeFile <span style="color:#4078f2">=</span> <span style="color:#a626a4">new</span> <span style="color:#b76b01">File</span><span style="color:#383a42">(</span>home<span style="color:#383a42">)</span><span style="color:#383a42">;</span><span style="color:#b76b01">File</span><span style="color:#383a42">[</span><span style="color:#383a42">]</span> htmlFile <span style="color:#4078f2">=</span> homeFile<span style="color:#383a42">.</span><span style="color:#4078f2">listFiles</span><span style="color:#383a42">(</span><span style="color:#a626a4">new</span> <span style="color:#b76b01">FilenameFilter</span><span style="color:#383a42">(</span><span style="color:#383a42">)</span> <span style="color:#383a42">{</span><span style="color:#a626a4">public</span> <span style="color:#a626a4">boolean</span> <span style="color:#4078f2">accept</span><span style="color:#383a42">(</span><span style="color:#b76b01">File</span> dir<span style="color:#383a42">,</span> <span style="color:#b76b01">String</span> name<span style="color:#383a42">)</span> <span style="color:#383a42">{</span><span style="color:#a626a4">if</span> <span style="color:#383a42">(</span>name<span style="color:#383a42">.</span><span style="color:#4078f2">lastIndexOf</span><span style="color:#383a42">(</span><span style="color:#50a14f">".html"</span><span style="color:#383a42">)</span> <span style="color:#4078f2">!=</span> <span style="color:#4078f2">-</span><span style="color:#b76b01">1</span><span style="color:#383a42">)</span> <span style="color:#383a42">{</span><span style="color:#a626a4">return</span> <span style="color:#b76b01">true</span><span style="color:#383a42">;</span><span style="color:#383a42">}</span> <span style="color:#a626a4">else</span> <span style="color:#383a42">{</span><span style="color:#a626a4">return</span> <span style="color:#b76b01">false</span><span style="color:#383a42">;</span><span style="color:#383a42">}</span><span style="color:#383a42">}</span><span style="color:#383a42">}</span><span style="color:#383a42">)</span><span style="color:#383a42">;</span><span style="color:#a626a4">for</span> <span style="color:#383a42">(</span><span style="color:#a626a4">int</span> i <span style="color:#4078f2">=</span> <span style="color:#b76b01">0</span><span style="color:#383a42">;</span> i <span style="color:#4078f2"><</span> htmlFile<span style="color:#383a42">.</span>length<span style="color:#383a42">;</span> i<span style="color:#4078f2">++</span><span style="color:#383a42">)</span> <span style="color:#383a42">{</span><span style="color:#b76b01">File</span> html <span style="color:#4078f2">=</span> htmlFile<span style="color:#383a42">[</span>i<span style="color:#383a42">]</span><span style="color:#383a42">;</span><span style="color:#b76b01">Document</span> doc <span style="color:#4078f2">=</span> <span style="color:#b76b01">Jsoup</span><span style="color:#383a42">.</span><span style="color:#4078f2">parse</span><span style="color:#383a42">(</span>html<span style="color:#383a42">,</span> <span style="color:#50a14f">"UTF-8"</span><span style="color:#383a42">,</span> <span style="color:#50a14f
http://www.khdw.cn/news/20676.html

相关文章:

  • 如何用微信小程序做网站便民信息微信平台推广
  • 通过网站做外贸白嫖永久服务器
  • 网站建设学多长时间网销平台排名
  • 当当网网站开发计划和预算智谋网站优化公司
  • 可以做图的网站手机网站制作教程
  • 梅州网站开发baidu 今日头条
  • 开发软件网站东莞今天新增加的情况
  • 网站开发具体步骤如何优化seo技巧
  • 怎么修改网站域名网站的优化
  • 下载一个app软件西安seo外包服务
  • 海口做网站公司怎么找拉新推广平台
  • 网站制作有什么好的介绍网络广告策划书案例
  • 怎么注册一个网站友情链接的形式有哪些
  • 洛阳网站建设公司百度网站站长工具
  • 西宁网站设计制作全网整合营销推广
  • 专业微网站建设公司首选公司哪家好搭建网站平台需要多少钱
  • 做网站类的书本信息沧州网站建设公司
  • 网站内页怎么做301定向优化大师有必要安装吗
  • 外国人做免费视频网站最佳的搜索引擎
  • gis网站开发实战教程常见的网络推广方法
  • 网站建设实训过程报告自动点击器免费下载
  • 当当网网站建设百度首页入口
  • 东莞网站建设推广汕头seo关键词排名
  • 作业设计模板seo资源网站 排名
  • 鄂州做网站公司刷网站关键词工具
  • 营销型网站建设的原则友情链接有哪些
  • 网页设计简单的网站如何自己建立一个网站
  • 如何建网站挣钱百度一下首页网页百度
  • 抖音电脑网页版入口seo推广主要做什么
  • 郑州做网站要多少钱网络营销推广的特点