54 KiB
Puppeteer API v0.1.0
Table of Contents
- Puppeteer
- class: Browser
- class: Page
- event: 'console'
- event: 'dialog'
- event: 'frameattached'
- event: 'framedetached'
- event: 'framenavigated'
- event: 'load'
- event: 'pageerror'
- event: 'request'
- event: 'requestfailed'
- event: 'requestfinished'
- event: 'response'
- page.addBinding(name, puppeteerFunction)
- page.addScriptTag(url)
- page.click(selector[, options])
- page.close()
- page.emulate(options)
- page.evaluate(pageFunction, ...args)
- page.evaluateOnNewDocument(pageFunction, ...args)
- page.focus(selector)
- page.frames()
- page.goBack(options)
- page.goForward(options)
- page.goto(url, options)
- page.hover(selector)
- page.injectFile(filePath)
- page.keyboard
- page.mainFrame()
- page.mouse
- page.pdf(options)
- page.plainText()
- page.press(key[, options])
- page.reload(options)
- page.screenshot([options])
- page.setContent(html)
- page.setExtraHTTPHeaders(headers)
- page.setRequestInterceptionEnabled(value)
- page.setUserAgent(userAgent)
- page.setViewport(viewport)
- page.title()
- page.tracing
- page.type(text, options)
- page.uploadFile(selector, ...filePaths)
- page.url()
- page.viewport()
- page.waitFor(selectorOrFunctionOrTimeout[, options])
- page.waitForFunction(pageFunction[, options, ...args])
- page.waitForNavigation(options)
- page.waitForSelector(selector[, options])
- class: Keyboard
- class: Mouse
- class: Tracing
- class: Dialog
- class: Frame
- frame.addScriptTag(url)
- frame.childFrames()
- frame.click(selector[, options])
- frame.evaluate(pageFunction, ...args)
- frame.focus(selector)
- frame.hover(selector)
- frame.injectFile(filePath)
- frame.isDetached()
- frame.name()
- frame.parentFrame()
- frame.title()
- frame.uploadFile(selector, ...filePaths)
- frame.url()
- frame.waitFor(selectorOrFunctionOrTimeout[, options])
- frame.waitForFunction(pageFunction[, options, ...args])
- frame.waitForSelector(selector[, options])
- class: Request
- class: Response
Puppeteer
Puppeteer is a Node library which provides a high-level API to control Chromium over the DevTools Protocol.
Puppeteer provides a top-level require which has a Browser class. The following is a typical example of using a Browser class to drive automation:
const {Browser} = require('puppeteer');
const browser = new Browser();
browser.newPage().then(async page => {
await page.goto('https://google.com');
// other actions...
browser.close();
});
class: Browser
Browser manages a browser instance, creating it with a predefined settings, opening and closing pages. Instantiating Browser class does not necessarily result in launching browser; the instance will be launched when the need will arise.
A typical scenario of using Browser is opening a new page and navigating it to a desired URL:
const {Browser} = require('puppeteer');
const browser = new Browser();
browser.newPage().then(async page => {
await page.goto('https://example.com');
browser.close();
});
new Browser([options])
options
<Object> Set of configurable options to set on the browser. Can have the following fields:ignoreHTTPSErrors
<boolean> Whether to ignore HTTPS errors during navigation. Defaults tofalse
.headless
<boolean> Whether to run chromium in headless mode. Defaults totrue
.executablePath
<string> Path to a chromium executable to run instead of bundled chromium. IfexecutablePath
is a relative path, then it is resolved relative to current working directory.slowMo
<number> Slows down Puppeteer operations by the specified amount of milliseconds. Useful so that you can see what is going on.args
<Array<string>> Additional arguments to pass to the chromium instance. List of chromium flags can be found here.
browser.close()
Closes browser with all the pages (if any were opened). The browser object itself is considered to be disposed and could not be used anymore.
browser.newPage()
browser.stderr
A Readable Stream that represents the browser process's stderr.
For example, stderr
could be piped into process.stderr
:
const {Browser} = require('puppeteer');
const browser = new Browser();
browser.stderr.pipe(process.stderr);
browser.version().then(version => {
console.log(version);
browser.close();
});
browser.stdout
A Readable Stream that represents the browser process's stdout.
For example, stdout
could be piped into process.stdout
:
const {Browser} = require('puppeteer');
const browser = new Browser();
browser.stdout.pipe(process.stdout);
browser.version().then(version => {
console.log(version);
browser.close();
});
browser.version()
- returns: <Promise<string>> String describing browser version. For headless chromium, this is similar to
HeadlessChrome/61.0.3153.0
. For non-headless, this isChrome/61.0.3153.0
.
Note
the format of browser.version() is not fixed and might change with future releases of the library.
class: Page
Page provides methods to interact with browser page. Page could be thought about as a browser tab, so one Browser instance might have multiple Page instances.
An example of creating a page, navigating it to a URL and saving screenshot as screenshot.png
:
const {Browser} = require('puppeteer');
const browser = new Browser();
browser.newPage().then(async page =>
await page.goto('https://example.com');
await page.screenshot({path: 'screenshot.png'});
browser.close();
});
event: 'console'
- <string>
Emitted when a page calls one of console API methods, e.g. console.log
or console.dir
.
If multiple arguments are passed over to the console API call, these arguments are dispatched in an event.
An example of handling console
event:
page.on('console', (...args) => {
for (let i =0; i < args.length; ++i)
console.log(`${i}: ${args[i]}`);
});
page.evaluate(() => console.log(5, 'hello', {foo: 'bar'}));
event: 'dialog'
- <Dialog>
Emitted when a JavaScript dialog, such as alert
, prompt
, confirm
or beforeunload
, gets opened on the page. Puppeteer can take action to the dialog via dialog's accept or dismiss methods.
event: 'frameattached'
- <Frame>
Emitted when a frame gets attached.
event: 'framedetached'
- <Frame>
Emitted when a frame gets detached.
event: 'framenavigated'
- <Frame>
Emitted when a frame committed navigation.
event: 'load'
Emitted when a page's load
event was dispatched.
event: 'pageerror'
- <string>
Emitted when an unhandled exception happens on the page. The only argument of the event holds the exception message.
event: 'request'
- <Request>
Emitted when a page issues a request. The request object is a read-only object.
In order to intercept and mutate requests, see page.setRequestInterceptionEnabled
.
event: 'requestfailed'
- <Request>
Emitted when a request is failed.
event: 'requestfinished'
- <Request>
Emitted when a request is successfully finished.
event: 'response'
- <Response>
Emitted when a response is received.
page.addBinding(name, puppeteerFunction)
name
<string> Name of the binding on window objectpuppeteerFunction
<function> Callback function which will be called in puppeteer's context.- returns: <Promise> Promise which resolves with the result of
puppeteerFunction
.
The method adds a function called name
on window
object.
When called, the function executes puppeteerFunction
function in puppeteer context and returns
a promise that resolves with the puppeteer's result.
If the puppeteerFunction
returns a promise, it would be awaited.
Note
All the bindings installed via the
page.addBinding
survive navigations.
An example of adding window.md5
binding to the page:
const {Browser} = require('puppeteer');
const browser = new Browser();
const crypto = require('crypto');
browser.newPage().then(async page => {
page.on('console', console.log);
await page.setInPageCallback('md5', text => crypto.createHash('md5').update(text).digest('hex'));
await page.evaluate(async () => {
// use window.md5 to compute hashes
let myString = 'PUPPETEER';
let myHash = await window.md5(myString);
console.log(`md5 of ${myString} is ${myHash}`);
});
browser.close();
});
An example of adding window.readfile
binding to the page:
const {Browser} = require('puppeteer');
const browser = new Browser();
const fs = require('fs');
browser.newPage().then(async page => {
page.on('console', console.log);
await page.setInPageCallback('readfile', async filePath => {
return new Promise((resolve, reject) => {
fs.readFile(filePath, 'utf8', (err, text) => {
if (err)
reject(err);
else
resolve(text);
});
});
});
await page.evaluate(async () => {
// use window.readfile to read contents of a file
let content = await window.readfile('/etc/hosts');
console.log(content);
});
browser.close();
});
page.addScriptTag(url)
url
<string> Url of a script to be added- returns: <Promise> Promise which resolves as the script gets added and loads.
Adds a <script>
tag to the frame with the desired url. Alternatively, JavaScript could be injected to the frame via frame.injectFile
method.
Shortcut for page.mainFrame().addScriptTag(url).
page.click(selector[, options])
selector
<string> A query selector to search for element to click. If there are multiple elements satisfying the selector, the first will be clicked.options
<Object>- returns: <Promise> Promise which resolves when the element matching
selector
is successfully clicked. Promise gets rejected if there's no element matchingselector
.
Shortcut for page.mainFrame().click(selector[, options]).
page.close()
- returns: <Promise> Returns promise which resolves when page gets closed.
page.emulate(options)
options
<Object>viewport
<Object> viewport as described inpage.setViewport
method.userAgent
<string> user agent string
- returns: <Promise> Promise which resolves when emulation is performed.
Emulates given device metrics and user agent. This method is a shortcut for calling two methods:
To aid emulation, puppeteer provides a list of device descriptors which could be obtained via the require('puppeteer/DeviceDescriptors')
command.
Below is an example of emulating iPhone 6 in puppeteer:
const {Browser} = require('puppeteer');
const devices = require('puppeteer/DeviceDescriptors');
const iPhone = devices['iPhone 6'];
const browser = new Browser();
browser.newPage().then(async page => {
await page.emulate(iPhone);
await page.goto('https://google.com');
// other actions...
browser.close();
});
List of all available devices is available in the source code: DeviceDescriptors.js.
page.evaluate(pageFunction, ...args)
pageFunction
<function|string> Function to be evaluated in browser context...args
<...string> Arguments to pass topageFunction
- returns: <Promise<Object>> Promise which resolves to function return value
If the function, passed to the page.evaluate
, returns a Promise, then page.evaluate
would wait for the promise to resolve and return it's value.
const {Browser} = require('puppeteer');
const browser = new Browser();
browser.newPage().then(async page =>
const result = await page.evaluate(() => {
return Promise.resolve(8 * 7);
});
console.log(result); // prints "56"
browser.close();
});
A string can also be passed in instead of a function.
console.log(await page.evaluate('1 + 2')); // prints "3"
Shortcut for page.mainFrame().evaluate(pageFunction, ...args).
page.evaluateOnNewDocument(pageFunction, ...args)
pageFunction
<function|string> Function to be evaluated in browser context...args
<...string> Arguments to pass topageFunction
- returns: <Promise<Object>> Promise which resolves to function
Adds a function which would be invoked in one of the following scenarios:
- whenever the page gets navigated
- whenever the child frame gets attached or navigated. In this case, the function gets invoked in the context of the newly attached frame
The function is invoked after the document was created but before any of its scripts were run. This is useful to amend JavaScript environment, e.g. to seed Math.random
page.focus(selector)
selector
<string> A query selector of element to focus. If there are multiple elements satisfying the selector, the first will be focused.- returns: <Promise> Promise which resolves when the element matching
selector
is successfully focused. Promise gets rejected if there's no element matchingselector
.
Shortcut for page.mainFrame().focus(selector).
page.frames()
page.goBack(options)
options
<Object> Navigation parameters which might have the following properties:timeout
<number> Maximum navigation time in milliseconds, defaults to 30 seconds.waitUntil
<string> When to consider navigation succeeded, defaults toload
. Could be either:load
- consider navigation to be finished when theload
event is fired.networkidle
- consider navigation to be finished when the network activity stays "idle" for at leastnetworkIdleTimeout
ms.
networkIdleInflight
<number> Maximum amount of inflight requests which are considered "idle". Takes effect only withwaitUntil: 'networkidle'
parameter.networkIdleTimeout
<number> A timeout to wait before completing navigation. Takes effect only withwaitUntil: 'networkidle'
parameter.
- returns: <Promise<Response>> Promise which resolves to the main resource response. In case of multiple redirects, the navigation will resolve with the response of the last redirect. If can not go back, resolves to null.
Navigate to the previous page in history.
page.goForward(options)
options
<Object> Navigation parameters which might have the following properties:timeout
<number> Maximum navigation time in milliseconds, defaults to 30 seconds.waitUntil
<string> When to consider navigation succeeded, defaults toload
. Could be either:load
- consider navigation to be finished when theload
event is fired.networkidle
- consider navigation to be finished when the network activity stays "idle" for at leastnetworkIdleTimeout
ms.
networkIdleInflight
<number> Maximum amount of inflight requests which are considered "idle". Takes effect only withwaitUntil: 'networkidle'
parameter.networkIdleTimeout
<number> A timeout to wait before completing navigation. Takes effect only withwaitUntil: 'networkidle'
parameter.
- returns: <Promise<Response>> Promise which resolves to the main resource response. In case of multiple redirects, the navigation will resolve with the response of the last redirect. If can not go back, resolves to null.
Navigate to the next page in history.
page.goto(url, options)
url
<string> URL to navigate page tooptions
<Object> Navigation parameters which might have the following properties:timeout
<number> Maximum navigation time in milliseconds, defaults to 30 seconds.waitUntil
<string> When to consider navigation succeeded, defaults toload
. Could be either:load
- consider navigation to be finished when theload
event is fired.networkidle
- consider navigation to be finished when the network activity stays "idle" for at leastnetworkIdleTimeout
ms.
networkIdleInflight
<number> Maximum amount of inflight requests which are considered "idle". Takes effect only withwaitUntil: 'networkidle'
parameter.networkIdleTimeout
<number> A timeout to wait before completing navigation. Takes effect only withwaitUntil: 'networkidle'
parameter. Defaults to 1000 ms.
- returns: <Promise<Response>> Promise which resolves to the main resource response. In case of multiple redirects, the navigation will resolve with the response of the last redirect.
The page.goto
will throw an error if:
- there's an SSL error (e.g. in case of self-signed certificates).
- target URL is invalid.
- the
timeout
is exceeded during navigation. - the main resource failed to load.
Note
page.goto
either throw or return a main resource response. The only exception is navigation toabout:blank
, which would succeed and returnnull
.
page.hover(selector)
selector
<string> A query selector to search for element to hover. If there are multiple elements satisfying the selector, the first will be hovered.- returns: <Promise> Promise which resolves when the element matching
selector
is successfully hovered. Promise gets rejected if there's no element matchingselector
.
Shortcut for page.mainFrame().hover(selector).
page.injectFile(filePath)
filePath
<string> Path to the JavaScript file to be injected into frame. IffilePath
is a relative path, then it is resolved relative to current working directory.- returns: <Promise> Promise which resolves when file gets successfully evaluated in frame.
Shortcut for page.mainFrame().injectFile(filePath).
page.keyboard
- returns: <Keyboard>
page.mainFrame()
- returns: <Frame> returns page's main frame.
Page is guaranteed to have a main frame which persists during navigations.
page.mouse
- returns: <Mouse>
page.pdf(options)
options
<Object> Options object which might have the following properties:path
<string> The file path to save the PDF to. Ifpath
is a relative path, then it is resolved relative to current working directory.scale
<number> Scale of the webpage rendering. Defaults to1
.displayHeaderFooter
<boolean> Display header and footer. Defaults tofalse
.printBackground
<boolean> Print background graphics. Defaults tofalse
.landscape
<boolean> Paper orientation. Defaults tofalse
.pageRanges
<string> Paper ranges to print, e.g., '1-5, 8, 11-13'. Defaults to the empty string, which means print all pages.format
<string> Paper format. If set, takes priority overwidth
orheight
options. Defaults to 'Letter'.width
<string> Paper width, accepts values labeled with units.height
<string> Paper height, accepts values labeled with units.margin
<Object> Paper margins, defaults to none.
- returns: <Promise<Buffer>> Promise which resolves with PDF buffer.
The width
, height
, and margin
options accept values labeled with units. Unlabeled values are treated as pixels.
A few examples:
page.pdf({width: 100})
- prints with width set to 100 pixelspage.pdf({width: '100px'})
- prints with width set to 100 pixelspage.pdf({width: '10cm'})
- prints with width set to 10 centimeters.
All possible units are:
px
- pixelin
- inchcm
- centimetermm
- millimeter
The format
options are:
Letter
: 8.5in x 11inLegal
: 8.5in x 14inTabloid
: 11in x 17inLedger
: 17in x 11inA0
: 33.1in x 46.8inA1
: 23.4in x 33.1inA2
: 16.5in x 23.4inA3
: 11.7in x 16.5inA4
: 8.27in x 11.7inA5
: 5.83in x 8.27in
Note
Generating a pdf is currently only supported in Chrome headless.
page.plainText()
page.press(key[, options])
key
<string> Name of key to press, such asArrowLeft
. See KeyboardEvent.keyoptions
<Object>- returns: <Promise>
Shortcut for keyboard.down
and keyboard.up
.
page.reload(options)
options
<Object> Navigation parameters which might have the following properties:timeout
<number> Maximum navigation time in milliseconds, defaults to 30 seconds.waitUntil
<string> When to consider navigation succeeded, defaults toload
. Could be either:load
- consider navigation to be finished when theload
event is fired.networkidle
- consider navigation to be finished when the network activity stays "idle" for at leastnetworkIdleTimeout
ms.
networkIdleInflight
<number> Maximum amount of inflight requests which are considered "idle". Takes effect only withwaitUntil: 'networkidle'
parameter.networkIdleTimeout
<number> A timeout to wait before completing navigation. Takes effect only withwaitUntil: 'networkidle'
parameter.
- returns: <Promise<Response>> Promise which resolves to the main resource response. In case of multiple redirects, the navigation will resolve with the response of the last redirect.
page.screenshot([options])
options
<Object> Options object which might have the following properties:path
<string> The file path to save the image to. The screenshot type will be inferred from file extension. Ifpath
is a relative path, then it is resolved relative to current working directory.type
<string> Specify screenshot type, could be eitherjpeg
orpng
. Defaults to 'png'.quality
<number> The quality of the image, between 0-100. Not applicable topng
images.fullPage
<boolean> When true, takes a screenshot of the full scrollable page. Defaults tofalse
.clip
<Object> An object which specifies clipping region of the page. Should have the following fields:
- returns: <Promise<Buffer>> Promise which resolves to buffer with captured screenshot
page.setContent(html)
html
<string> HTML markup to assign to the page.- returns: <Promise> Promise which resolves when the content is successfully assigned.
page.setExtraHTTPHeaders(headers)
headers
<Map> A map of additional http headers to be sent with every request.- returns: <Promise> Promise which resolves when additional headers are installed
The extra HTTP headers will be sent with every request the page initiates.
Note
page.setExtraHTTPHeaders does not guarantee the order of headers in the outgoing requests.
page.setRequestInterceptionEnabled(value)
value
<boolean> Whether to enable request interception.- returns: <Promise> Promise which resolves when request interception change is applied.
Activating request interception enables request.abort
and request.continue
.
An example of a naïve request interceptor which aborts all image requests:
const {Browser} = require('puppeteer');
const browser = new Browser();
browser.newPage().then(async page =>
await page.setRequestInterceptionEnabled(true);
page.on('request', request => {
if (interceptedRequest.url.endsWith('.png') || interceptedRequest.url.endsWith('.jpg'))
interceptedRequest.abort();
else
interceptedRequest.continue();
});
await page.goto('https://example.com');
browser.close();
});
page.setUserAgent(userAgent)
userAgent
<string> Specific user agent to use in this page- returns: <Promise> Promise which resolves when the user agent is set.
page.setViewport(viewport)
viewport
<Object> An object with two fields:width
<number> Specify page's width in pixels.height
<number> Specify page's height in pixels.deviceScaleFactor
<number> Specify device scale factor (could be though of as dpr). Defaults to1
.isMobile
<boolean> Whether themeta viewport
tag is taken into account. Defaults tofalse
.hasTouch
<boolean> Specify if viewport supports touch events. Defaults tofalse
isLandscape
<boolean> Specify if viewport is in the landscape mode. Defaults tofalse
.
- returns: <Promise> Promise which resolves when the dimensions are updated.
Note
in certain cases, setting viewport will reload the page so that the
isMobile
orhasTouch
options will be able to interfere in project loading.
The page's viewport size defines page's dimensions, observable from page via window.innerWidth / window.innerHeight
. The viewport size defines a size of page
screenshot (unless a fullPage
option is given).
In case of multiple pages in one browser, each page can have its own viewport size.
page.title()
Shortcut for page.mainFrame().title().
page.tracing
- returns: <Tracing>
page.type(text, options)
text
<string> A text to type into a focused element.options
<Object>delay
<number> Time to wait between key presses in milliseconds. Defaults to 0.
- returns: <Promise> Promise which resolves when the text has been successfully typed.
Sends a keydown
, keypress
/input
, and keyup
event for each character in the text.
To press a special key, use page.press
.
page.type('Hello'); // Types instantly
page.type('World', {delay: 100}); // Types slower, like a user
page.uploadFile(selector, ...filePaths)
selector
<string> A query selector to a file input...filePaths
<string> Sets the value of the file input these paths. If some of thefilePaths
are relative paths, then they are resolved relative to current working directory.- returns: <Promise> Promise which resolves when the value is set.
Shortcut for page.mainFrame().uploadFile(selector, ...filePaths).
page.url()
- returns: <string>
This is a shortcut for page.mainFrame().url()
page.viewport()
- returns: <Object> An object with the save fields as described in page.setViewport
page.waitFor(selectorOrFunctionOrTimeout[, options])
selectorOrFunctionOrTimeout
<string|number|function> A selector, predicate or timeout to wait foroptions
<Object> Optional waiting parameters- returns: <Promise>
This method behaves differently with respect to the type of the first parameter:
- if
selectorOrFunctionOrTimeout
is astring
, than the first argument is treated as a selector to wait for and the method is a shortcut for frame.waitForSelector - if
selectorOrFunctionOrTimeout
is afunction
, than the first argument is treated as a predicate to wait for and the method is a shortcut for frame.waitForFunction(). - if
selectorOrFunctionOrTimeout
is anumber
, than the first argument is treated as a timeout in milliseconds and the method returns a promise which resolves after the timeout - otherwise, an exception is thrown
Shortcut for page.mainFrame().waitFor(selectorOrFunctionOrTimeout[, options]).
page.waitForFunction(pageFunction[, options, ...args])
pageFunction
<function|string> Function to be evaluated in browser contextoptions
<Object> Optional waiting parameterspolling
<string|number> An interval at which thepageFunction
is executed, defaults toraf
. Ifpolling
is a number, then it is treated as an interval in milliseconds at which the function would be executed. Ifpolling
is a string, then it could be one of the following values:raf
- to constantly executepageFunction
inrequestAnimationFrame
callback. This is the tightest polling mode which is suitable to observe styling changes.mutation
- to executepageFunction
on every DOM mutation.
timeout
<number> maximum time to wait for in milliseconds. Defaults to30000
(30 seconds).
...args
<...Object> Arguments to pass topageFunction
- returns: <Promise> Promise which resolves when element specified by selector string is added to DOM.
The waitForFunction
could be used to observe viewport size change:
const {Browser} = require('.');
const browser = new Browser();
browser.newPage().then(async page => {
const watchDog = page.waitForFunction('window.innerWidth < 100');
page.setViewport({width: 50, height: 50});
await watchDog;
browser.close();
});
Shortcut for page.mainFrame().waitForFunction(pageFunction[, options, ...args]).
page.waitForNavigation(options)
options
<Object> Navigation parameters which might have the following properties:timeout
<number> Maximum navigation time in milliseconds, defaults to 30 seconds.waitUntil
<string> When to consider navigation succeeded, defaults toload
. Could be either:load
- consider navigation to be finished when theload
event is fired.networkidle
- consider navigation to be finished when the network activity stays "idle" for at leastnetworkIdleTimeout
ms.
networkIdleInflight
<number> Maximum amount of inflight requests which are considered "idle". Takes effect only withwaitUntil: 'networkidle'
parameter.networkIdleTimeout
<number> A timeout to wait before completing navigation. Takes effect only withwaitUntil: 'networkidle'
parameter.
- returns: <Promise<Response>> Promise which resolves to the main resource response. In case of multiple redirects, the navigation will resolve with the response of the last redirect.
page.waitForSelector(selector[, options])
selector
<string> CSS selector of awaited element,options
<Object> Optional waiting parameters- returns: <Promise> Promise which resolves when element specified by selector string is added to DOM.
Wait for the selector
to appear in page. If at the moment of calling
the method the selector
already exists, the method will return
immediately. If the selector doesn't appear after the timeout
milliseconds of waiting, the function will throw.
This method works across navigations:
const {Browser} = new require('puppeteer');
const browser = new Browser();
browser.newPage().then(async page => {
let currentURL;
page.waitForSelector('img').then(() => console.log('First URL with image: ' + currentURL));
for (currentURL of ['https://example.com', 'https://google.com', 'https://bbc.com'])
await page.goto(currentURL);
browser.close();
});
Shortcut for page.mainFrame().waitForSelector(selector[, options]).
class: Keyboard
Keyboard provides an api for managing a virtual keyboard. The high level api is page.type
, which takes raw characters and generates proper keydown, keypress/input, and keyup events on your page.
For finer control, you can use keyboard.down
, keyboard.up
, and keyboard.sendCharacter
to manually fire events as if they were generated from a real keyboard.
An example of holding down Shift
in order to select and delete some text:
page.type('Hello World!');
page.press('ArrowLeft');
page.keyboard.down('Shift');
for (let i = 0; i < ' World'.length; i++)
page.press('ArrowLeft');
page.keyboard.up('Shift');
page.press('Backspace');
// Result text will end up saying 'Hello!'
keyboard.down(key[, options])
key
<string> Name of key to press, such asArrowLeft
. See KeyboardEvent.keyoptions
<Object>text
<string> If specified, generates an input event with this text.
- returns: <Promise>
Dispatches a keydown
event.
This will not send input events unless text
is specified.
If key
is a modifier key, Shift
, Meta
, Control
, or Alt
, subsequent key presses will be sent with that modifier active. To release the modifier key, use keyboard.up
.
After the key is pressed once, subsequent calls to keyboard.down
will have repeat set to true. To release the key, use keyboard.up
.
keyboard.sendCharacter(char)
Dispatches a keypress
and input
event. This does not send a keydown
or keyup
event.
page.keyboard.sendCharacter('嗨');
keyboard.up(key)
key
<string> Name of key to release, such asArrowLeft
. See KeyboardEvent.key- returns: <Promise>
Dispatches a keyup
event.
class: Mouse
mouse.click(x, y, [options])
Shortcut for mouse.move
, mouse.down
and mouse.up
.
mouse.down([options])
Dispatches a mousedown
event.
mouse.move(x, y)
Dispatches a mousemove
event.
mouse.up([options])
Dispatches a mouseup
event.
class: Tracing
You can use tracing.start
and tracing.stop
to create a trace file which can be opened in Chrome DevTools or timeline viewer.
await page.tracing.start({path: 'trace.json'});
await page.goto('https://www.google.com');
await page.tracing.stop();
tracing.start(options)
Only one trace can be active at a time per browser.
tracing.stop()
- returns: <Promise>
class: Dialog
Dialog objects are dispatched by page via the 'dialog' event.
An example of using Dialog
class:
const {Browser} = require('puppeteer');
const browser = new Browser({headless: false});
browser.newPage().then(async page => {
page.on('dialog', dialog => {
console.log(dialog.message());
dialog.dismiss();
browser.close();
});
page.evaluate(() => alert('1'));
});
dialog.accept([promptText])
promptText
<string> A text to enter in prompt. Does not cause any effects if the dialog'stype
is not prompt.- returns: <Promise> Promise which resolves when the dialog has being accepted.
dialog.defaultValue()
- returns: <string> If dialog is prompt, returns default prompt value. Otherwise, returns empty string.
dialog.dismiss()
- returns: <Promise> Promise which resolves when the dialog has being dismissed.
dialog.message()
- returns: <string> A message displayed in the dialog.
dialog.type
- <string>
Dialog's type, could be one of the alert
, beforeunload
, confirm
and prompt
.
class: Frame
At every point of time, page exposes its current frame tree via the page.mainFrame() and frame.childFrames() methods.
Frame object's lifecycle is controlled by three events, dispatched on the page object:
- 'frameattached' - fired when the frame gets attached to the page. Frame could be attached to the page only once.
- 'framenavigated' - fired when the frame commits navigation to a different URL.
- 'framedetached' - fired when the frame gets detached from the page. Frame could be detached from the page only once.
An example of dumping frame tree:
const {Browser} = new require('.');
const browser = new Browser({headless: true});
browser.newPage().then(async page => {
await page.goto('https://www.google.com/chrome/browser/canary.html');
dumpFrameTree(page.mainFrame(), '');
browser.close();
function dumpFrameTree(frame, indent) {
console.log(indent + frame.url());
for (let child of frame.childFrames())
dumpFrameTree(child, indent + ' ');
}
});
frame.addScriptTag(url)
url
<string> Url of a script to be added- returns: <Promise> Promise which resolves as the script gets added and loads.
Adds a <script>
tag to the frame with the desired url. Alternatively, JavaScript could be injected to the frame via frame.injectFile
method.
frame.childFrames()
frame.click(selector[, options])
selector
<string> A query selector to search for element to click. If there are multiple elements satisfying the selector, the first will be clicked.options
<Object>- returns: <Promise> Promise which resolves when the element matching
selector
is successfully clicked. Promise gets rejected if there's no element matchingselector
.
frame.evaluate(pageFunction, ...args)
pageFunction
<function|string> Function to be evaluated in browser context...args
<...string> Arguments to pass topageFunction
- returns: <Promise<Object>> Promise which resolves to function return value
If the function, passed to the page.evaluate
, returns a Promise, then page.evaluate
would wait for the promise to resolve and return it's value.
const {Browser} = require('puppeteer');
const browser = new Browser();
browser.newPage().then(async page =>
const result = await page.evaluate(() => {
return Promise.resolve(8 * 7);
});
console.log(result); // prints "56"
browser.close();
});
A string can also be passed in instead of a function.
console.log(await page.evaluate('1 + 2')); // prints "3"
frame.focus(selector)
selector
<string> A query selector of element to focus. If there are multiple elements satisfying the selector, the first will be focused.- returns: <Promise> Promise which resolves when the element matching
selector
is successfully focused. Promise gets rejected if there's no element matchingselector
.
frame.hover(selector)
selector
<string> A query selector to search for element to hover. If there are multiple elements satisfying the selector, the first will be hovered.- returns: <Promise> Promise which resolves when the element matching
selector
is successfully hovered. Promise gets rejected if there's no element matchingselector
.
frame.injectFile(filePath)
filePath
<string> Path to the JavaScript file to be injected into frame. IffilePath
is a relative path, then it is resolved relative to current working directory.- returns: <Promise> Promise which resolves when file gets successfully evaluated in frame.
frame.isDetached()
- returns: <boolean>
Returns true
if the frame has being detached, or false
otherwise.
frame.name()
- returns: <string>
Returns frame's name attribute as specified in the tag.
If the name is empty, returns the id attribute instead.
Note
This value is calculated once when the frame is created, and will not update if the attribute is changed later.
frame.parentFrame()
- returns: <Frame> Returns parent frame, if any. Detached frames and main frames return
null
.
frame.title()
frame.uploadFile(selector, ...filePaths)
selector
<string> A query selector to a file input...filePaths
<string> Sets the value of the file input these paths. If some of thefilePaths
are relative paths, then they are resolved relative to current working directory.- returns: <Promise> Promise which resolves when the value is set.
frame.url()
- returns: <string>
Returns frame's url.
frame.waitFor(selectorOrFunctionOrTimeout[, options])
selectorOrFunctionOrTimeout
<string|number|function> A selector, predicate or timeout to wait foroptions
<Object> Optional waiting parameters- returns: <Promise>
This method behaves differently with respect to the type of the first parameter:
- if
selectorOrFunctionOrTimeout
is astring
, than the first argument is treated as a selector to wait for and the method is a shortcut for frame.waitForSelector - if
selectorOrFunctionOrTimeout
is afunction
, than the first argument is treated as a predicate to wait for and the method is a shortcut for frame.waitForFunction(). - if
selectorOrFunctionOrTimeout
is anumber
, than the first argument is treated as a timeout in milliseconds and the method returns a promise which resolves after the timeout - otherwise, an exception is thrown
frame.waitForFunction(pageFunction[, options, ...args])
pageFunction
<function|string> Function to be evaluated in browser contextoptions
<Object> Optional waiting parameterspolling
<string|number> An interval at which thepageFunction
is executed, defaults toraf
. Ifpolling
is a number, then it is treated as an interval in milliseconds at which the function would be executed. Ifpolling
is a string, then it could be one of the following values:raf
- to constantly executepageFunction
inrequestAnimationFrame
callback. This is the tightest polling mode which is suitable to observe styling changes.mutation
- to executepageFunction
on every DOM mutation.
timeout
<number> maximum time to wait for in milliseconds. Defaults to30000
(30 seconds).
...args
<...Object> Arguments to pass topageFunction
- returns: <Promise> Promise which resolves when element specified by selector string is added to DOM.
The waitForFunction
could be used to observe viewport size change:
const {Browser} = require('.');
const browser = new Browser();
browser.newPage().then(async page => {
const watchDog = page.waitForFunction('window.innerWidth < 100');
page.setViewport({width: 50, height: 50});
await watchDog;
browser.close();
});
frame.waitForSelector(selector[, options])
selector
<string> CSS selector of awaited element,options
<Object> Optional waiting parameters- returns: <Promise> Promise which resolves when element specified by selector string is added to DOM.
Wait for the selector
to appear in page. If at the moment of calling
the method the selector
already exists, the method will return
immediately. If the selector doesn't appear after the timeout
milliseconds of waiting, the function will throw.
This method works across navigations:
const {Browser} = new require('puppeteer');
const browser = new Browser();
browser.newPage().then(async page => {
let currentURL;
page.waitForSelector('img').then(() => console.log('First URL with image: ' + currentURL));
for (currentURL of ['https://example.com', 'https://google.com', 'https://bbc.com'])
await page.goto(currentURL);
browser.close();
});
class: Request
Whenever the page sends a request, the following events are emitted by puppeteer's page:
- 'request' emitted when the request is issued by the page.
- 'response' emitted when/if the response is received for the request.
- 'requestfinished' emitted when the response body is downloaded and the request is complete.
If request fails at some point, then instead of 'requestfinished' event (and possibly instead of 'response' event), the 'requestfailed' event is emitted.
If request gets a 'redirect' response, the request is successfully finished with the 'requestfinished' event, and a new request is issued to a redirected url.
Request class represents requests which are sent by page. Request implements Body mixin, which in case of HTTP POST requests allows clients to call request.json()
or request.text()
to get different representations of request's body.
request.abort()
Aborts request. To use this, request interception should be enabled with page.setRequestInterceptionEnabled
.
Exception is immediately thrown if the request interception is not enabled.
request.continue([overrides])
overrides
<Object> Optional request overwrites, which could be one of the following:
Continues request with optional request overrides. To use this, request interception should be enabled with page.setRequestInterceptionEnabled
.
Exception is immediately thrown if the request interception is not enabled.
request.headers
- <Map> A map of HTTP headers associated with the request.
request.method
- <string>
Contains the request's method (GET, POST, etc.)
request.postData
- <string>
Contains the request's post body, if any.
request.response()
request.url
- <string>
Contains the URL of the request.
class: Response
Response class represents responses which are received by page. Response implements Body mixin, which allows clients to call response.json()
or response.text()
to get different representations of response body.
response.buffer()
- returns: <Promise<Buffer>> Promise which resolves to a buffer with response body.
response.headers
- <Map> A map of HTTP headers associated with the response.
response.json()
- returns: <Promise<Object>> Promise which resolves to a JSON representation of response body.
This method will throw if the response body is not parsable via JSON.parse
.
response.ok
- <boolean>
Contains a boolean stating whether the response was successful (status in the range 200-299) or not.
response.request()
response.status
- <number>
Contains the status code of the response (e.g., 200 for a success).
response.text()
- returns: <Promise<[text]>> Promise which resolves to a text representation of response body.
response.url
- <string>
Contains the URL of the response.