Google索引的文件类型

发布于:2024-06-03 ⋅ 阅读:(119) ⋅ 点赞:(0)

File types indexable by Google

Google 可以索引大多数文本文件和某些编码文件格式的内容。我们索引的最常见文件类型包括:

Google can index the content of most text-based files and certain encoded document formats. The most common file types we index include:

  • Adobe Portable Document Format (.pdf)

  • Adobe PostScript (.ps)

  • Comma-Separated Values (.csv)

  • Electronic Publication (.epub)

  • Google Earth (.kml, .kmz)

  • GPS eXchange Format (.gpx)

  • Hancom Hanword (.hwp)

  • HTML (.htm, .html, other file extensions)

  • Microsoft Excel (.xls, .xlsx)

  • Microsoft PowerPoint (.ppt, .pptx)

  • Microsoft Word (.doc, .docx)

  • OpenOffice presentation (.odp)

  • OpenOffice spreadsheet (.ods)

  • OpenOffice text (.odt)

  • Rich Text Format (.rtf)

  • Scalable Vector Graphics (.svg)

  • TeX/LaTeX (.tex)

  • Text (.txt, .text, other file extensions), including source code in common programming languages, such as:

    • Basic source code (.bas)

    • C/C++ source code (.c, .cc, .cpp, .cxx, .h, .hpp)

    • C# source code (.cs)

    • Java source code (.java)

    • Perl source code (.pl)

    • Python source code (.py)

  • Wireless Markup Language (.wml, .wap)

  • XML (.xml)

Google can also index the following media formats:

  • Image formats: BMP, GIF, JPEG, PNG, WebP, and SVG

  • Video formats: 3GP, 3G2, ASF, AVI, DivX, M2V, M3U, M3U8, M4V, MKV, MOV, MP4, MPEG, OGV, QVT, RAM, RM, VOB, WebM, WMV, and XAP

参考:

https://developers.google.com/search/docs/crawling-indexing/indexable-file-types