How to properly use xpath & regexp extractor in jmeter? - regex

I have the following text in the HTML response:
<input type="hidden" name="test" value="testValue">
I need to extract the value from the above input tag.
I've tried both regexp and xpath extractor, but neither is working for me:
regexp pattern
input\s*type="hidden"\s*name="test"\s*value="(.+)"\s*>
xpath query
//input[#name="test"]/#value
The above xpath gives an error at the Xpath Assertion Listener .. "No node matched".
I tried a lot and concluded that the xpath works only if I use it as //input[#name].
At the moment I'm trying to add an actual name it gives the error .. "No node matched".
Could anyone please suggest me how to resolve the above issue?

Please take a look at my previous answer :
https://stackoverflow.com/a/11452267/169277
The relevant part for you would be step 3:
import org.jsoup.Jsoup;
import org.jsoup.nodes.Document;
import org.jsoup.select.Element;
String html = prev.getResponseDataAsString(); // get response from your sampler
Document doc = Jsoup.parse(html);
Element inputElement = doc.select("input[name=test]").first();
String inputValue = inputElement.attr("value");
vars.put("inputTextValue", inputValue);
Update
So you don't get tangled with the code I've created jMeter post processor called Html Extractor here is the github url :
https://github.com/c0mrade/Html-Extractor

Since you are using XPath Extractor to parse HTML (not XML) response ensure that Use Tidy (tolerant parser) option is CHECKED (in XPath Extractor's control panel).
Your xpath query looks fine, check the option mentioned above and try again.

Related

Extract a value from actionlink html response body - JMeter

I have a response body from one http request and I have to extract all the values and as input to another http request
<a class="action-link"
regionName="region name"
jName="a country"
alertID="179"
onclick="showDetail(this)"> Snapshot- v4 (Active)
</a>
i tried with regex like
created a post processors -> regular expression extractors
Apply to -> Main Sample Only
Field to check -> Body
Name of created variable -> regionNameVariable
Regular Expression -> <a class="action-link" regionName=(.*?)
and passed the value as ${regionNameVariable} in the next request.
But It is not working.
Can someone suggest the correct way of doing this.
Be aware that using regular expressions for parsing HTML is not a very good idea, consider using CSS Selector Extractor instead, the relevant configuration would be:
Name of created variable: regionNameVariable
CSS Selector Expression: a[class=action-link]
Attribute: regionName
Demo:
More information:
CSS Selector Reference
How to Use the CSS/JQuery Extractor in JMeter
Add multiple Regular expression extractor to extract each values
For example to extract the 3rd group in regex of Region Name(You can write better Regx if you have better idea)
Like Wise Add Regex for jName and alertID
(jName)(=)\"(.+)\"
(alertID)(=)\"(.+)\"
Then Pass on the Reference name as user variable in your next http request.as ${regionName} and so on.
If you want to do it in the same Regex post Processor. Please refer How to extract multiple values with a regular expression in Jmeter

Xpath query returns no element found

I'm querying google search for number of searches. the xpath query i'm using is //div[#id='resultStats'] which too my understanding on the pages html:
<div id="resultStats">About 1,660,000,000 results<nobr> (0.65 seconds) </nobr></div>
should return the data within the div.
I've tried with Importhtml(url,xpath) and http://xpather.com/ <- an XML path tester and I get "Imported content is empty" and "no content found" respectively.
I was initially using importhtml and isolated the issue to the XPath using xpather as an XPath tester, so I think I've zoned the issue down a bit. Any help would be appreciated.
try:
=VALUE(REGEXREPLACE(MID(INDEX(IMPORTHTML(
"https://www.google.com/search?q="&A1&"";"table";1);4;2);6;23);"\D+";))

Need a regexr to fetch the multiple data

I need a regrex to get the value in the below format
Code=ABC&cellNo=314&payType=0&transmission=MANUAL&vendorId=ET&toggleSelctd=0&uniqueId=39
This is a sample code is attached in the below screenshot
but however the regrex which i have created fetches data for one value but i need the same in the above format, for example below are the regrex.
name="cellNo" type="hidden" value="(.+?)" value="(.+?)
name="transmission" type="hidden" value="(.+?)"
Thanks
Don't use regular expressions to parse HTML. JMeter offers CSS/JQuery Extractor for fetching data from the HTML pages so you should be able to extract the required values using the following configurations:
For MANUAL:
Reference Name: anything meaningful, i.e. transmission
CSS/JQuery Expression: input[id=transmission]
Attribute: value
For 314
Reference Name: anything meaningful, i.e. CellNo
CSS/JQuery Expression: input[id=CellNo]
Attribute: value
See How to Use the CSS/JQuery Extractor in JMeter article for more information.
I think response markup is a little bit flaky as HTML input cannot have 2 name and 2 value tags, I would recommend using JMeter's HTML Assertion or online HTML Validator to check response data and raise issues on validation errors.

Extract texts until certain patterns on Scrapy

I'm trying to scrape certain contents from a webpage using Scrapy.
The html element looks like below.
'<p>\n 阪急宝塚線\xa0/\xa0石橋駅\xa0徒歩1分\n (<a href="javascript:void(0);" style="cursor:pointer;" onclic
k=\'window.open("http://athome.ekiworld.net/?id=athome&to=asso 302 ワンルーム&to_near_station1=25824&to_near_time1=1&to_near_traffic1=徒歩 1 分");return false;\'>電車ルート案内</a>)\n
</p>'
My goal is to extract only this part "阪急宝塚線\xa0/\xa0石橋駅\xa0徒歩1分\n".
I tried to use .re() with response and I thought ^(.+?<a) would work since it succeeded parsing on https://regex101.com/. But on scrapy shell, it doesn't parse anything (gives me []).
Could someone help me with this?
I use Python3/scrapy1.3.0.
Thanks!
import re
text = '''<p>\n 阪急宝塚線\xa0/\xa0石橋駅\xa0徒歩1分\n (<a href="javascript:void(0);" style="cursor:pointer;" onclic
k=\'window.open("http://athome.ekiworld.net/?id=athome&to=asso 302 ワンルーム&to_near_station1=25824&to_near_time1=1&to_near_traffic1=徒歩 1 分");return false;\'>電車ルート案内</a>)\n
</p>'''
re.search(r'\n.+?\n', text).group()
out:
'\n 阪急宝塚線\xa0/\xa0石橋駅\xa0徒歩1分\n'

Jmeter Regular Expression Extraction

Could someone help me in getting the value "1237857346" from the following using regex or any other way I could get the value "1237857346" in from HTML in JMeter.
<select class="card_account" name="existing__account">
<option value="" disabled="disabled">Card Number</option>
<option value="1237857346" selected="selected">************4567</option>
</select>
Little bit of background. I am using JMeter and trying to extra the value "1237857346" to pass it in the next request.
It is not very good idea to parse HTML using Regular Expressions as it evidenced by the famous Stack Overflow answer
I would suggest switching to XPath Extractor instead. Add the XPath Extractor as a child of HTTP Request sampler which returns that select and configure it as follows:
XML Parsing Options: tick Use Tidy box. It may not be necessary but if your server response is not XML/XHTML compliant you'll get nothing
Reference Name: anything meaningful, i.e. value - it will be the name of the variable holding extracted data
XPath Expression: //select[#class='card_account']/option[#selected='selected']/#value - it will take
select having class = card_account
option with selected = "selected"
value attribute of the above option
and store it to "value" variable. You will be able to refer to it as ${value} where required.
See following material for further reference:
XPath Tutorial
XPath Language Specification
Using the XPath Extractor in JMeter
You can use the following regex:
<option[^<]*>Card Number</option>\s*<option[^<]*?value="(\d+)"
The value will be in group 1 ($1$), which is exactly what you need.
See demo
In case the are always 12 asterisks (that can be matched with \*{12}) in the <option> node value, you'd can use:
<option[^<]*value="(\d+)"[^<]*>\*{12}\d+</option>
See another demo.