how to eliminate outlier in spotfire box plots - max

Thanks for your help in advance.
Regards,
Raj

Adding the values to MAX() values would skew the data even if it were possible. There are two hacks to do this though.
Right Click > Properties > Y-Axis > set the MIN range value and MAX range values to something that would eliminate all outliers. This is really only suitable for box plots that are close in all values to each other (all percentiles)
On your toolbar click Insert > Calculated Column > choose the correct data table and paste in the expression below. You will need to replace the [x-axisColumn] and the [y-axisColumn] with what ever is on the X and Y axis of your box plot. To find out what this is, right click on your box plot > properties > X-Axis and there it will tell you what your X-Axis column is. Do this for the Y-Axis as well. This will create a column called "Outlier" in your data table with three options: NotOutlier, UpperOutlier, LowerOutlier. You just need to uncheck "UpperOutlier" and "LowerOutlier" to remove your outliers.
`
case
when [y-axisColumn]<(Q1([y-axisColumn]) - (1.5 * IQR([y-axisColumn]) OVER ([x-axisColumn]))) then "LowerOutlier"
when [y-axisColumn]>(Q3([y-axisColumn]) + (1.5 * IQR([y-axisColumn]) OVER ([x-axisColumn]))) then "UpperOutlier"
else "NotOutlier"
end as [Outlier]
`

Related

How do you get data to plot in SQLDeveloper Report Charting in a Split Dual Y Chart?

I am trying to use the Reporting capability in SQL Developer. I can get the charting to work fine except with a split dual-Y chart. How do I write the query and configure the report to get data to plot on both of the charts? Right now I have a query with the series, X value, Y1 Value, and Y2 value, but only the Y1 value plots. I added what I believe is the second value in the "Data" pane by having two rows in the "Mapping" section. Any help is appreciated.

multiple x values spotfire

I am currently trying to adapt an excel dashboard I created a few years back into spotfire. The data previously fit onto one page, but this year the number of metrics has doubled and I thought it would be a good opportunity to try spotfire.
The current dashboard has a number of scatter plot charts showing the values from a range of cells (rank 1-5). Overlaid is another scatter plot that shows the groups results.
I have found it easy enough to run the scatter plot in Excel based on a range of data, but as of right now I am stumped on how to choose a range of values for my spotfire scatters.
Here is an example (in excel) of what I am trying to accomplish. The data example is how my data is currently setup.
Would you have any tips on how I may be able to produce a similar chart in Spotfire?
You can create a Document.Properties with all the values you want. Then create a slider in a text area with this document properties. You will be able to select a range of values with this slider.
To use your range in a scatter plot, on the axis you want, right click and custom expression then write something like $(my_docproperty).
Hope it will a solution to your problem !

Is it possible to create an XY (aka scatter) plot using Kibana 4?

I have a few million documents in an ElasticSearch index with some numeric fields, say foo and bar. Is there any way to use Kibana 4 to create a graph with foo values on the X axis and bar values on the Y axis? Like a very, very basic chart one might create using Excel.
I'm fine with sampling/aggregations of some kind. I understand that these tools won't show me a plot with 20 million data points. I'm just trying to see if there's some obvious relationship between foo and bar by creating a graph.
To just plot the correlation between revenue and employee count I would just use a line chart like this:
In order to justify creating a scatter plot chart though (since they're awesome and I wanted to) I generated some fake data that looked something like this:
{
name: faker.company.companyName(),
employees: _.random(3, 30),
revenue: _.random(10000, 100000),
industry: _.sample(industries)
}
And plotted it in visualize by breaking it down piece-by-piece:
Start with a line chart
Switch to Options tab of sidebar (since 4.1)
Uncheck "Show Connecting Lines"
Check "Scale Y-Axis to Data Bounds"
Switch back to the Data tab
Modify the "Y-Axis"
use the Average aggregation
on the employees field
Add a "Dot Size" metric
use the Unique Count aggregation
on the company field
Add a "Split Lines" bucket
use the Terms aggregation
on the industry field
I like to set the size close to the cardinality of my data
Add an "X-Axis"
use the Histogram aggregation
on the revenue field
guess an interval, you will need to play with this a bit
Finally, click Apply
This configuration is pretty complex, but the resulting visualization shows a lot of information.
I found a hack for this.
Create a line chart
X axis is a Terms aggregation of foo
Add sub-aggregation (Split Lines) on the same field
Y Axis is sum of your other column (bar)
I don't see any way of making the legend meaningful, though

How to slice a data cube in Birt designer?

I created a data cube in Birt designer with the following data :
Measure : turnover
Dimensions : product, location and time
I have now a three-dimensional OLAP data cube I want to display in a two-dimensional array.
My problem here is I don't know how to assign a constant value for one dimension (for example : product = "computer" or time = "2014")
There are a couple of ways to do this. In terms of performance, the most efficient one would be to add a parameter to the main dataset of the cube and filter here.
If you can't filter in the dataset (for example if other report elements make use of the third datacube dimension), another option is:
Add the dimension you want to slice as a first column dimension of the crosstab, such "Product line" dimension of the screen below.
Add a cross-tab filter and set an expression filtering on this dimension, for example by filtering on the value of a report parameter or on a constant value
Facultative, if you don't want it to be displayed: select the value of the dimension in the crosstab (just click "PRODUCTLINE" label in this example) and turn the visibility property to "hide".

Can a static height be set for subreports in reporting services VS2010?

I've made a report using reporting services (Visual Studio 2010) and I want the subreport to have a predefined height. For example, the subreport will be 2 in. even if the report data only fills up 1 in. of the subreport. Is there a way to force a subreport to have a static height?
I have run into this same issue.
To set a minimum size, add a rectangle with a horizontal and vertical line inside it, and place the subreport inside the rectangle.
One solution I can think of for setting the max size, is to use top(x) in the sub report query, so that any results which exceed the number of lines to be shown are simply not included.
Alternatively, if all results must be included, then add a calculated field to the subreport dataset which counts the rows of data. In the subreport table, for each column set the font size such that all lines of data can be displayed within the given area. (iif(NoOfResults = 2, font = 10, etc))The 'Allow row height to decrease' will also need to be set on the table, so the rows can shrink in accord with the font size.
Hope someone finds this helpful. If you can think of any other solutions, I'd love to hear them.

Resources