unable to merge content in NIFI using merge content processor - apache-nifi

I have a text file and i need to merge text files into one single file using NIFI
when I am using merge content processor data is appending unable to merge
example :- if my text file has data like this
in file1:- 2201
and in file2:- 2202
after using merge content in nifi I am getting output as 22012202
merge content processor :-
but I need out put as below
2201
2202
2203
can any one help me with this i need to do this in nifi only

Put Delimiter Strategy as Text and set the Demarcator a new-line by entering shift+enter in the demarcator value.

Related

How to use an attribute content into Search Value of Nifi ReplaceTex processor

I am following nifi guide to parse a delimited file content.
Instead of coding the search text and replacement value i want to use the content of two attributes.
processor config
When executed the processor is not using the attribute content as a regexp even if it is a valid regex expression

NiFi - Change Filename (CSV) adding an attribute from a incoming JSON

I am trying to modify the name of some files with NiFi getting a value from a JSON an adding to the original filename (for example filename.csv (original name) + january (name that provides the incoming JSON)). To do so, I am redirecting the CSV files to updateAttribute to change de Filename.
From the other hand, I am receiving a JSON that has an attribute that will be part of the name of the file.
On the evaluateJsonPath the configuration is the next (I am receiving it correctly the value):
And finally I am trying to merge the values in the UpdateAttribute processor (here is where it doesn't work properly):
The response I am getting is _filename.csv
You have quotes around name inside your Expression Language expression, try
${name}_${filename} or
${name:append('_'):append(${filename})}

Split FlowContent by line and extract text to attributes NIFI

Im using NIFI and i want to extract attributes of my file lines .
My Filetext looks like this :
DEV=A9E ,SEN=1
DEV=B9E ,SEN=2
...
And i want to split text by line and then extract dev and sen to attribute , any way to do this with NIFI, i have tried split text and split content but I can't see how can I split text by line.
SplitText with a Line Count of 1 is generally the approach to split a text file line-by-line. It will use \r, \n, or \r\n as the end of a line.
ExtractText would be used to parse each line and extract parts of the line into flow file attributes. You need to come up with a regular expression that uses capture groups to capture the parts you are interested in.

Changing delimiter used by jmeter to save results as CSV

I have a Jmeter script which will fire some HTTP requests. From the requests i am getting the required values to my output.
In the jmeter.properties file i have mentioned as follow:
# For use with Comma-separated value (CSV) files or other formats
# where the fields' values are separated by specified delimiters
# Default:
#jmeter.save.saveservice.default_delimiter=|
# For TAB, since JMeter 2.3 one can use:
#jmeter.save.save-service.default_delimiter=\t
But it is not reflecting in my output.
How to resolve this issue?
I have the same problem. I've added to the jmeter.properties file the following lines:
jmeter.save.saveservice.output_format=csv
jmeter.save.saveservice.default_delimiter=;
csvread.delimiter=;
Now Simple Data Writer saves results using ; to separate fields' values. But when I save data in the Aggregate Report element (by clicking the 'Save Table Data' button) the values in result aggregate.csv file are separated by ,
If you really did what you are describing in the code excerpt, then you forgot to uncomment fields by removing # before the concerned line.
Furthermore you are making a mistake in the property names, they should be:
jmeter.save.saveservice.default_delimiter=|
jmeter.save.saveservice.default_delimiter=\t
For full reference, see jmeter.properties.
But I am not sure to answer your question , as it is not clear though, can you clarify it.
Regards
Philippe M.

Hadoop custom split of TextFile

I have a fairly large text file that I would like to convert into a SequenceFile. Unfortunately, the file consists of Python code with logical lines running over several physical lines. For example,
print "Blah Blah\
... blah blah"
Each logical line is terminated by a NEWLINE. Could someone clarify how I could possibly generate Key, Value pairs in Map-Reduce where each Value is the entire logical line?
I don't find the question asked earlier, but you just have to iterate over your lines via a simple mapreduce job and save them into a StringBuilder. Flush the StringBuilder to the context if you want to begin with a new record. The trick is to setup the StringBuilder in your mappers class as a field and not as a local variable.
here it is:
Processing paraphragraphs in text files as single records with Hadoop
You should create your own variation on TextInputFormat. In there you make a new RecordReader that skips lines until it sees the start of a logical line.
Preprocess the input file to remove the newlines. What is your goal in creating the SequenceFile?

Resources