Updated release with minor improvements to functions to read in Microsoft Word and PowerPoint files.
Components on PowerPoint slides are stored in a named list to preserve structure. Tables on PowerPoint slides are now detected and extracted as character matrices.
File is read in, broken by XML defined paragraph and returned as a vector.
File is read in, each slide is processed and returned as an element of a list. Each slide has most components identified (titles, subtitles, text blocks, shapes, tables) and extracts the text. This text is returned as either a data.frame or a matrix (for tables) with minor formating details provided. This text is stored in a named list (names are the slide component names).
Need a high-speed mirror for your open-source project?
Contact our mirror admin team at info@clientvps.com.
This archive is provided as a free public service to the community.
Proudly supported by infrastructure from VPSPulse , RxServers , BuyNumber , UnitVPS , OffshoreName and secure payment technology by ArionPay.