Elasticsearch index a file automatically - elasticsearch

I am new to elasticsearch,this question might look weird but is it possible to index a file automatically (i.e given a file path, elasticsearch should index the contents of it automatically).I have got some open source tool like elasticdump and tried using it for the purpose,but I prefer some plugins of elasticsearch which can support almost all elasticsearch versions.. Can anyone suggest me?

Related

Kibana geojson upload hangs on creating index

When uploading a geojson file into a map, Kibana hangs indefinitely on "Writing to index".
It's the same issue as in this fixed bug: https://github.com/elastic/kibana/issues/40102
I have Kibana v7.6.1, so it should be fixed in this version.
The index I want to use is one of these (I think this is the website I got it from): https://datamillnorth.org/dataset/ons-ward-boundaries
The geojson file is only 1.7 MB. I've had the same problem with all the other geojson files I've tried. I haven't successfully uploaded any geojson files.
If I look in Index Management, the index is present, but it doesn't come up as an option when I make a new map and try to add an index layer.
Bug report on Kibana: https://github.com/elastic/kibana/issues/61794
It relates to an elastic search and it's because the file I was using is bad.
I used the mapshaper CLI to fix the file: https://github.com/mbloch/mapshaper/wiki/Introduction-to-the-Command-Line-Tool
mapshaper -i leeds.geojson -clean -o leeds_fixed.geojson
This originally didn't work with the file I had. I got a different version of the file from a different website (https://martinjc.github.io/UK-GeoJSON/), and ran the above CLI, and it fixed the problem in Kibana.

Run Elastic Search on pdf and ppts

I am new to elastic search. I have read its tutorials. But need guidance on my problem:
I have a collection of pdf documents and power point files on my system. I need to build a system using elastic search where I can retrieve these files on the basis of keywords present in this file. Can someone please guide as to how can I proceed here and index my documents.Do I need to parse my pdf and convert it to JSON format using Tika or FSCrawler and then provide it to elastic search.
Thankyou.
You should setup FSCrawler, that'll do the parsing and make the files content searchable.

Elastic search next steps

I'm new to elasticsearch and am still trying to set it up. I have installed elasticsearch 5.5.1 using default values I have also installed Kibana 5.5.1 using the default values. I've also installed the ingest-attachment plugin with the latest x-pack plugin. I have elasticsearch running as a service and I have Kibana open in my browser. On the Kibana dashboardI have an error stating that it is unable to fetch mappings. I guess this is because I havn't set up any indices or pipelines yet. This is where I need some steer, all the documentation I've found so far on-line isn't particularly clear. I have a directory with a mixture of document types such as pdf and doc files. My ultimate goal is to be able to search these documents with values that a user will enter via an app. I'm guessing I need to use the Dev Tools/console window in Kibana using the 'PUT' command to create a pipeline next, but I'm unsure of how I should do this so that it points to my directory with the documents. Can anybody provide me an example of this for this version please.
If I understand you correctly, let's first set some basic understanding about elasticsearch:
Elasticsearch in it's simple definition is a "Search engine". so you need to store some data, and then elastic will help you to search using a search criteria, and it will retrieve relevant data back
You need a "Container" to save your data to, and elastic has this thing like any database engine to store your data, but the terms are somehow different. for example a "Database" in sql-like systems is called "Index", and what you know as "table" is called "Type" in elastic.
from my understanding, you will need to create your index (with or without mappings) to have a starting point, and I recommend you to start without mappings just to "start" and get things working, but later on it's highly recommend to work with "mappings" if applicable, because elastic is smart, but it cannot know more about your data than you do
Because Kibana has failed to find a proper index to start with, it has complained and asked you to either provide a syntax for index names, or a specific index name so it can infer the inline mappings and give you the nice features of querying, displaying charts, etc of your data, so once you create your index, you will provide that to the starting page of Kibana, and you will be ready to go.
Let me know if you need something more specific to your needs :)

Mapper Attachment to Kibana issues

I have created some index in Elasticsearch with mapper attachment plugin. However, when I try to create index in Kibana, I could not find back any data created in Elasticsearch for making dashboard in Kibana
Is there any way to resolve this issue?
Try running http://:9200/_cat/indices?v
The above will return all indexes you have. Once you verified that your mapper attachment index is there, go to Kibana at Settings tab and select the checkbox that say your index do not contain time series data. Now write your index name and I hope you find it. Also, make sure your Kibana is configured to point to the Elasticsearch server your index resides. This is configured in the config/kibana.yaml.
Hope I have managed to help!

files indexing automatically by elasticsearch

I am a newbie in elasticsearch, please forgive me if my question sounds weird :D
I want to index files in some directories with elasticsearch automatically (for example: if i add a file in certain directory then elasticsearch can index that file immediately), but i don't know how to configure elasticsearch in order to solve that problem.
Can anyone suggest me?
Thank in advance
I dont think you can have elasticsearch watch a directory (I wouldn't think that is a good thing to do in most cases.)
Instead, have a client wrapper that implements a FileWatcher. Push changes to ElasticSearch via this client.
You could use PathHierarchyTokenizer to preserve the file system hierarchy in your index, allowing you to drill down your Directory structure.

Resources