无力的HtmlUnit得到的框架内容(HtmlUnit unable to get frame co

2019-10-18 09:53发布

我想设置一个搜索框的值,点击搜索按钮,并解析结果。 问题是,结果显示在另一个框架,我不能够获得其他框架。 编码:

import com.gargoylesoftware.htmlunit.BrowserVersion;
import com.gargoylesoftware.htmlunit.FailingHttpStatusCodeException;
import com.gargoylesoftware.htmlunit.WebClient;
import com.gargoylesoftware.htmlunit.html.HtmlElement;
import com.gargoylesoftware.htmlunit.html.HtmlPage;
import com.gargoylesoftware.htmlunit.html.HtmlPasswordInput;
import com.gargoylesoftware.htmlunit.html.HtmlTextInput;

import java.io.IOException;
import java.net.MalformedURLException;

public class LoginSimulation
{
     public static void main(String args[])
     {
          HtmlPage page = null;
          String url = "http://www.ravmilim.co.il/naerr.asp";

          WebClient webClient = new WebClient(BrowserVersion.FIREFOX_3_6);
          webClient.setThrowExceptionOnScriptError(false);

          try
          {
               page = webClient.getPage( url );

               HtmlTextInput userInput = (HtmlTextInput) page.getElementById("txtUser");
               userInput.setValueAttribute("yacov.schondorf@gmail.com");

               HtmlPasswordInput passwordInput = (HtmlPasswordInput) page.getElementById("txtPass");
               passwordInput.setValueAttribute("5750201");

               HtmlElement theElement2 = (HtmlElement) page.getElementById("submitButton");
               page = theElement2.click();  

              HtmlPage framePage = (HtmlPage)               nextPage.getFrames().get(0).getEnclosedPage(); 
              HtmlTextInput searchBox = (HtmlTextInput)                            framePage.getForms().get(0).getInputsByName("searchBox").get(0);

              //
              // so far so good...
              //
              searchBox.setValueAttribute("word");
              HtmlAnchor anchor = framePage.getHtmlElementById("sl");
              HtmlPage page1 = (HtmlPage) anchor.click(); 
              try { 
                  HtmlPage resultsPage = (HtmlPage) page1.getFrameByName("resault1").getEnclosedPage();// this should have worked!!
              } catch (Exception e) { 
                  //
                  // I get an ElementNotFoundException
                  //
                  e.printStackTrace(); 
              } 

              // 
              // must logout - this site is sensitive to multiple logins
              //
              framePage.getAnchorByHref("logout.asp").click();

              webClient.closeAllWindows();            
          }
          catch ( Exception e )
          {
               e.printStackTrace();
          }
       }    
    }

Answer 1:

您可以尝试使用下面的代码来获取框架:

HtmlPage framePage = (HtmlPage)pageAfterLogin.getFrameByName("your_frame_name").getEnclosedPage();

让你可以得到HTML元素(像你这样的登录页)此页面内帧的源之后。



Answer 2:

你可以尝试这个例子(取自http://htmlunit.sourceforge.net/frame-howto.html )

final WebClient client = new WebClient();
final HtmlPage mainPage = client.getPage("http://htmlunit.sourceforge.net/apidocs/index.html");

为了得到第一帧(左上)的页面,然后点击第六链接:

final HtmlPage packageListPage = (HtmlPage) mainPage.getFrames().get(0).getEnclosedPage();
packageListPage.getAnchors().get(5).click();

要获得名为“packageFrame”框架的页面(左下),并点击第二个链接:

final HtmlPage pakcagePage = (HtmlPage) mainPage.getFrameByName("packageFrame").getEnclosedPage();
pakcagePage.getAnchors().get(1).click();

要获得命名的框架的页面“classFrame”(如右图):

final HtmlPage classPage = (HtmlPage) mainPage.getFrameByName("classFrame").getEnclosedPage();


文章来源: HtmlUnit unable to get frame content