Wie mache ich eine Regex-Suche in Nokogiri nach Text, der einem bestimmten Anfang entspricht?

Gegeben:Wie mache ich eine Regex-Suche in Nokogiri nach Text, der einem bestimmten Anfang entspricht?

require 'rubygems' 
require 'nokogiri' 
value = Nokogiri::HTML.parse(<<-HTML_END) 
"<html> 
<body> 
    <p id='para-1'>A</p> 
    <div class='block' id='X1'> 
    <h1>Foo</h1> 
    <p id='para-2'>B</p> 
    </div> 
    <p id='para-3'>C</p> 
    <h2>Bar</h2> 
    <p id='para-4'>D</p> 
    <p id='para-5'>E</p> 
    <div class='block' id='X2'> 
    <p id='para-6'>F</p> 
    </div> 
</body> 
</html>" 
HTML_END

ich so etwas tun will, was ich in Hpricot tun kann:

divs = value.search('//div[@id^="para-"]')

Wie für Elemente in XPath Stil ein Muster Such ich tun?
Wo finde ich die Dokumentation, die mir hilft? Ich habe das in den Rdocs nicht gesehen.

Quelle

2009-10-12 bcolfer

PSA: Für diejenigen, komplexere regex versucht, dies ist wahrscheinlich, was Sie suchen: http://stackoverflow.com/questions/649963/ nokogiri-suche-nach-div-using-xpath – DreadPirateShawn

Verwenden Sie die XPath-Funktion starts-with:

value.xpath('//p[starts-with(@id, "para-")]').each { |x| puts x['id'] }

Quelle

2009-10-12 18:28:38

+29

Wow, Aaron selbst hat es gerade beantwortet! – khelll

@khelll was ist so cool in Aaron? –

Autor von Nokogiri und RoR-Kernteam-Mitglied. – khelll

Und einige docs Sie suchen:

Nokogiri: http://nokogiri.org/
XPath: http://www.w3.org/TR/xpath20/
CSS3-Selektoren: http://www.w3.org/TR/selectors/

Quelle

2009-10-12 22:44:20

divs = value.css('div[id^="para-"]')

Quelle

2010-06-25 22:48:01

das ist ein Lebensretter – Onichan

Nokogiri::XML::Node.send(:define_method, 'xpath_regex') { |*args| 
    xpath = args[0] 
    rgxp = /\/([a-z]+)\[@([a-z\-]+)~=\/(.*?)\/\]/ 
    xpath.gsub!(rgxp) { |s| m = s.match(rgxp); "/#{m[1]}[regex(.,'#{m[2]}','#{m[3]}')]" } 
    self.xpath(xpath, Class.new { 
    def regex node_set, attr, regex 
     node_set.find_all { |node| node[attr] =~ /#{regex}/ } 
    end 
    }.new) 
}

Verbrauch:

divs = Nokogiri::HTML(page.root.to_html). 
    xpath_regex("//div[@class~=/axtarget$/]//div[@class~=/^carbo/]")

Quelle

2016-01-08 13:31:41 karwan

Antwort

Verwandte Themen