Posts Topics Forums Images
Search videos from message boards Videos Search messages from microblogs Microblogs Search messages from imdb.com Imdb Search messages from yuku.com Yuku Search messages from lefora.com (free forums) Lefora
My account: Login | Sign Up
Loading... 

Thread: How to crawl image files like jpeg, gif, ...

Started 1 year, 10 months ago by develoWorks
Hello, I want to index and find image files. I`ve already included the entries '.jpg' and '.jpeg' in the crawl options menu of the concerning data source. Furthermore, I`ve changed the file '/WEB-INF/config.properties'. I added the file type name 'jpg' with the file type extension 'jpg image/jpeg'. Nonetheless, the files can`t be found when using the enterprise search, ...
Site: developerWorks : Information Management Forums  developerWorks : Information Management Forums - site profile
Forum: IBM OmniFind Enterprise Edition   IBM OmniFind Enterprise Edition
 - forum profile
Total authors: 5 authors
Total thread posts: 7 posts
Thread activity: no new posts during last week
Domain info for: ibm.com

Other posts in this thread:

damorris replied 1 year, 10 months ago
OmniFind doesn't index image files and I honestly have no idea why you would want to do so in an enterprise search arena. What would you search on? The only place where I can see applicability as far as indexing an "image" file, is for scanned document images. You could then use OCR (optical character recognition) to extract the text from the image. Other than that, the only other...

develoWorks replied 1 year, 10 months ago
It was planned to search the image files according to their metadata. It`s a pity that Omnifind is not able to index such files. But anyway, thank you for your quick answer. Message was edited by: develoWorks

spriye replied 1 year, 4 months ago
I am surprised when you say that Omnifind doesnt crawl image files. Inspite of my mimetypes set to exclude image/tiff it is still being picked up. Is it doing so because it is trying to crawl based off the metadata associated with those images. I have tried every possible scenario to limit the crawl space so that the image files dont get crawled and yet they are. I have posted couple ...

Boudy replied 1 month, 2 weeks ago
Hello Dmorris, As you stated in your post "the only other way it makes sense is if the images are in a content repository and have metadata associated with them in which case you would index the metadata", is that do-able OR NOT ?? if i have a content Repository (FileNet P8 4.5) and i have image files uploaded to this repository, can i index these files using omnifind OR not ??...

mauriziog replied 1 month, 2 weeks ago
image_indexing.zip (1.5 KB)

Boudy replied 1 month, 2 weeks ago
Hey there mauriziog, I tried the approach that you specified and it worked fine (with some abnormalities, like if you try to search with the file name it doesnt return any results, but if you search with the extension jpg it gets them all) Next I will try it with filenet (if we can ever connect the IICE to filenet :S) Thanks alot for the fast reply

 

Top contributing authors

Name
Posts
Boudy
2
user's latest post:
How to crawl image files like...
Published (2009-11-09 08:20:00)
Hey there mauriziog, I tried the approach that you specified and it worked fine (with some abnormalities, like if you try to search with the file name it doesnt return any results, but if you search with the extension jpg it gets them all) Next I will try it with filenet (if we can ever connect the IICE to filenet :S) Thanks alot for the fast reply
develoWorks
2
user's latest post:
How to crawl image files like...
Published (2008-02-05 09:26:00)
It was planned to search the image files according to their metadata. It`s a pity that Omnifind is not able to index such files. But anyway, thank you for your quick answer. Message was edited by: develoWorks
spriye
1
user's latest post:
How to crawl image files like...
Published (2008-08-05 18:11:00)
I am surprised when you say that Omnifind doesnt crawl image files. Inspite of my mimetypes set to exclude image/tiff it is still being picked up. Is it doing so because it is trying to crawl based off the metadata associated with those images. I have tried every possible scenario to limit the crawl space so that the image files dont get crawled and yet they are. I have posted couple of threads on this but to no response. Do you have any ideas?
mauriziog
1
user's latest post:
How to crawl image files like...
Published (2009-11-08 13:24:00)
image_indexing.zip (1.5 KB)
damorris
1
user's latest post:
How to crawl image files like...
Published (2008-02-05 08:54:00)
OmniFind doesn't index image files and I honestly have no idea why you would want to do so in an enterprise search arena. What would you search on? The only place where I can see applicability as far as indexing an "image" file, is for scanned document images. You could then use OCR (optical character recognition) to extract the text from the image. Other than that, the only other way it makes sense is if the images are in a...

Related threads on "developerWorks : Information Management Forums":

Related threads on other sites:

Thread profile page for "How to crawl image files like jpeg, gif, ..." on http://www.ibm.com/developerworks/db2/. This report page is a snippet summary view from a single thread "How to crawl image files like jpeg, gif, ...", located on the Message Board at http://www.ibm.com/developerworks/db2/. This thread profile page shows the thread statistics for: Total Authors, Total Thread Posts, and Thread Activity