bell notificationshomepageloginNewPostedit profile

Topic : Create automatic index from text document Is there some software which can automatically do a basic index from my Text document (MS Word) for me? I have written multiple speaker text's (each - selfpublishingguru.com

10.03% popularity

Is there some software which can automatically do a basic index from my Text document (MS Word) for me?

I have written multiple speaker text's (each 2-3 MS Word pages in german language) for videos I created (25~ Videos).

Now I want to automatically generate a basic index of each text so people can search roughly in which of the Videos their search criteria lies. As I am under time pressure and really just need some basic index (I will have time later to go after the details).

NOTE: I also take workarounds if there is no chance of doing that, but please something that is at least faster than having to read all documents through again and evaluate every word :(


Load Full (3)

Login to follow topic

More posts by @Murphy332

3 Comments

Sorted by latest first Latest Oldest Best

10% popularity

Good Luck! Consider something a bit better suited to working with things of that sort, I heartily recommend LaTeX. It's efficient, powerful, and very powerful. There is a StackExchange for it, tex.stackexchange.com


Load Full (0)

10% popularity

Keep in mind that you are talking about creating concordances, not subject indexes. Subject indexes cannot be done automatically but require human analysis for substance and quality. For quality results, a good search will search both the text and the human-created index, thereby giving you both concordance findings as well as analysis for relations, alternative phrasing, etc., which concordances do not provide.

For more information, please visit the American Society for Indexing (ASI).

Pilar Wyman
Immediate-Past President, ASI
Chief Indexer, Wyman Indexing


Load Full (0)

10% popularity

I guess now I understood what your question was. Indeed there are a lot of software that can do what you want but I guess your premise is wrong since this is not related to create a table of content for the document but to index the document content allowing -- at least from what I understood - complexes searches.

Normally this is done trough software not to index, by to manage content. For example a CMS. (https://en.wikipedia.org/wiki/Content_management_system)

In that way, the search and the indexing procedures are not performed by MS Word but for some other system. Please note that the search service from windows and linux commands will allow to do the same kind of searches but they will work locally only.

Just as an example, I work with eZ Publish (http://ez.no/Products/eZ-Publish-CMS). It uses Solr (http://lucene.apache.org/solr/) to index the contents and the very CMS performs the searches. Of course I'm talking about Client/Server software, quite complex but, since you want people to search, I think you will have any kind of server to share content.

I guess wordpress (http://wordpress.org/) can do it using plugins, but I'm not sure.

In any case, if you want to share searches and information, I think your premise is wrong since it will not be on word document level but one step higher, in the software that will share and manage the content. Maybe you should talk to your editor about that or the webadmin or the developer that will share/sell/distribute your book

I really hope I wasn’t too technical. This is quite hard to explain. Anyway, if this answer goes towards your question and you have any more doubts, comment here and I'll try to explain in a clearer way.

Append:
I just reminded something. Google can index PDF files and allow searches. You can add a custom search box for your site, for example, assuming that's the way you want to share content. Please check www.seoconsult.com/seoblog/seo-techniques/can-google-fully-index-pdf-files.html


Load Full (0)

Back to top