HTTP Downloads
XMLHttpRequest
and fetch
browser APIs enable binary data transfer between
web browser clients and web servers. Since this library works in web browsers,
server conversion work can be offloaded to the client! This demo shows a few
common scenarios involving browser APIs and popular wrapper libraries.
This demo focuses on downloading files. Other demos cover other HTTP use cases:
- "HTTP Uploads" covers uploading files
- "HTTP Server Processing" covers HTTP servers
Third-party cloud platforms such as AWS may corrupt raw binary downloads by encoding requests and responses in UTF-8 strings.
For AWS, in the "Binary Media Types" section of the API Gateway console, the
"application/vnd.ms-excel"
type should be added to ensure that AWS Lambda
functions functions can send files to clients.
Downloading Binary Data
Most interesting spreadsheet files are binary data that contain byte sequences that represent invalid UTF-8 characters.
APIs generally provide options to control how downloaded data is interpreted.
The arraybuffer
response type usually forces the data to be presented as an
ArrayBuffer
object which can be parsed with the SheetJS read
method1.
The following example shows the data flow using fetch
to download files:
/* download data into an ArrayBuffer object */
const res = await fetch("https://sheetjs.com/pres.numbers");
const ab = await res.arrayBuffer(); // recover data as ArrayBuffer
/* parse file */
const wb = XLSX.read(ab);
Browser Demos
When the page is accessed, the browser will attempt to download https://sheetjs.com/pres.numbers
and read the workbook. The old table will be replaced with a table whose
contents match the first worksheet. The table is generated using the SheetJS
sheet_to_html
method2
Each browser demo was tested in the following environments:
Browser | Date |
---|---|
Chrome 120 | 2024-01-15 |
Safari 17.2 | 2024-01-15 |
XMLHttpRequest
For downloading data, the arraybuffer
response type generates an ArrayBuffer
that can be viewed as an Uint8Array
and fed to the SheetJS read
method. For
legacy browsers, the option type: "array"
should be specified:
/* set up an async GET request */
var req = new XMLHttpRequest();
req.open("GET", url, true);
req.responseType = "arraybuffer";
req.onload = function(e) {
/* parse the data when it is received */
var data = new Uint8Array(req.response);
var workbook = XLSX.read(data, {type:"array"});
/* DO SOMETHING WITH workbook HERE */
};
req.send();
Live Download demo (click to show)
This demo uses XMLHttpRequest
to download https://sheetjs.com/pres.numbers
and show the data in an HTML table.
fetch
For downloading data, Response#arrayBuffer
resolves to an ArrayBuffer
that
can be converted to Uint8Array
and passed to the SheetJS read
method:
fetch(url).then(function(res) {
/* get the data as a Blob */
if(!res.ok) throw new Error("fetch failed");
return res.arrayBuffer();
}).then(function(ab) {
/* parse the data when it is received */
var data = new Uint8Array(ab);
var workbook = XLSX.read(data, {type:"array"});
/* DO SOMETHING WITH workbook HERE */
});
Live Download demo (click to show)
This demo uses fetch
to download https://sheetjs.com/pres.numbers and show
the data in an HTML table.
jQuery
jQuery is a JavaScript library that includes helpers for
performing "Ajax" network requests. jQuery.ajax
($.ajax
) does not support
binary data out of the box3. A custom ajaxTransport
can add support.
SheetJS users have reported success with jquery.binarytransport.js
4 in IE10.
After including the main jquery.js
and jquery.binarytransport.js
scripts,
$.ajax
will support dataType: "binary"
and processData: false
.
In a GET request, the default behavior is to return a Blob
object. Passing
responseType: "arraybuffer"
returns a proper ArrayBuffer
object in IE10:
$.ajax({
type: "GET", url: "https://sheetjs.com/pres.numbers",
/* suppress jQuery post-processing */
processData: false,
/* use the binary transport */
dataType: "binary",
/* pass an ArrayBuffer in the callback */
responseType: "arraybuffer",
success: function (ab) {
/* at this point, ab is an ArrayBuffer */
var wb = XLSX.read(ab);
/* do something with workbook here */
var ws = wb.Sheets[wb.SheetNames[0]];
var html = XLSX.utils.sheet_to_html(ws);
$("#out").html(html);
}
});
Wrapper Libraries
Before fetch
shipped with browsers, there were various wrapper libraries to
simplify XMLHttpRequest
. Due to limitations with fetch
, these libraries are
still relevant.
axios
axios
presents a Promise based interface. Setting
responseType
to arraybuffer
ensures the return type is an ArrayBuffer. The
data
property of the result can be passed to the SheetJS read
method:
async function workbook_dl_axios(url) {
const res = await axios(url, {responseType:'arraybuffer'});
const workbook = XLSX.read(res.data);
return workbook;
}
Live Download demo (click to show)
This demo uses axios
to download https://sheetjs.com/pres.numbers and show
the data in an HTML table.
If the live demo shows a message
ReferenceError: axios is not defined
please refresh the page. This is a known bug in the documentation generator.
superagent
superagent
is a network request library
with a "Fluent Interface". Calling the responseType
method with
"arraybuffer"
will ensure the final response object is an ArrayBuffer
:
/* set up an async GET request with superagent */
superagent
.get(url)
.responseType('arraybuffer')
.end(function(err, res) {
/* parse the data when it is received */
var data = new Uint8Array(res.body);
var workbook = XLSX.read(data, {type:"array"});
/* DO SOMETHING WITH workbook HERE */
});
Live Download demo (click to show)
This demo uses superagent
to download https://sheetjs.com/pres.numbers and
show the data in an HTML table.
If the live demo shows a message
ReferenceError: superagent is not defined
please refresh the page. This is a known bug in the documentation generator.
NodeJS Demos
These examples show how to download data in NodeJS.
HTTPS GET
The https
module provides a low-level get
method for HTTPS GET requests:
var https = require("https"), XLSX = require("xlsx");
https.get('https://sheetjs.com/pres.numbers', function(res) {
var bufs = [];
res.on('data', function(chunk) { bufs.push(chunk); });
res.on('end', function() {
var buf = Buffer.concat(bufs);
var wb = XLSX.read(buf);
/* print the first worksheet to console */
var ws = wb.Sheets[wb.SheetNames[0]];
console.log(XLSX.utils.sheet_to_csv(ws));
});
});
Complete Example (click to show)
This demo was last tested on 2024 January 15 against NodeJS 20.11.0
1) Install the NodeJS module
npm i --save https://cdn.sheetjs.com/xlsx-0.20.1/xlsx-0.20.1.tgz
2) Copy the SheetJSHTTPSGet.js
code snippet to a file SheetJSHTTPSGet.js
3) Run the script:
node SheetJSHTTPSGet.js
If successful, the script will print CSV contents of the test file.
fetch
Experimental support for fetch
was introduced in NodeJS 16.15.0
. It will be
considered stable in NodeJS LTS version 22
.
The fetch
implementation has the same return types as the browser version:
async function parse_from_url(url) {
const res = await fetch(url);
if(!res.ok) throw new Error("fetch failed");
const ab = await res.arrayBuffer();
const workbook = XLSX.read(ab);
return workbook;
}
Complete Example (click to show)
This demo was last tested on 2024 January 15 against NodeJS 20.11.0
1) Install the NodeJS module
npm i --save https://cdn.sheetjs.com/xlsx-0.20.1/xlsx-0.20.1.tgz
2) Save the following to SheetJSFetch.js
:
var XLSX = require("xlsx");
async function parse_from_url(url) {
const res = await fetch(url);
if(!res.ok) throw new Error("fetch failed");
const ab = await res.arrayBuffer();
const workbook = XLSX.read(ab);
return workbook;
}
(async() => {
const wb = await parse_from_url('https://sheetjs.com/pres.numbers');
/* print the first worksheet to console */
var ws = wb.Sheets[wb.SheetNames[0]];
console.log(XLSX.utils.sheet_to_csv(ws));
})();
3) Run the script:
node SheetJSFetch.js
If successful, the script will print CSV contents of the test file.
Wrapper Libraries
The latest releases of NodeJS support fetch
natively. Before fetch
support
was added to the platform, third party modules wrapped the native APIs.
request
request
has been deprecated and should only be used in legacy deployments.
Setting the option encoding: null
passes raw buffers:
var XLSX = require('xlsx'), request = require('request');
var url = 'https://sheetjs.com/pres.numbers';
/* call `request` with the option `encoding: null` */
request(url, {encoding: null}, function(err, res, data) {
if(err || res.statusCode !== 200) return;
/* if the request was successful, parse the data */
var wb = XLSX.read(data);
/* print the first worksheet to console */
var ws = wb.Sheets[wb.SheetNames[0]];
console.log(XLSX.utils.sheet_to_csv(ws));
});
Complete Example (click to show)
This demo was last tested on 2024 January 15 against request 2.88.2
1) Install the NodeJS module
npm i --save https://cdn.sheetjs.com/xlsx-0.20.1/xlsx-0.20.1.tgz [email protected]
2) Copy the SheetJSRequest.js
code snippet to a file SheetJSRequest.js
3) Run the script:
node SheetJSRequest.js
If successful, the script will print CSV contents of the test file.
axios
When the responseType
is "arraybuffer"
, axios
actually captures the data
in a NodeJS Buffer. The SheetJS read
method handles NodeJS Buffer objects:
const XLSX = require("xlsx"), axios = require("axios");
async function workbook_dl_axios(url) {
const res = await axios(url, {responseType:'arraybuffer'});
/* at this point, res.data is a Buffer */
const workbook = XLSX.read(res.data);
return workbook;
}
Complete Example (click to show)
This demo was last tested on 2024 January 15 against Axios 1.6.5
1) Install the NodeJS module
npm i --save https://cdn.sheetjs.com/xlsx-0.20.1/xlsx-0.20.1.tgz [email protected]
2) Save the following to SheetJSAxios.js
:
const XLSX = require("xlsx"), axios = require("axios");
async function workbook_dl_axios(url) {
const res = await axios(url, {responseType:'arraybuffer'});
/* at this point, res.data is a Buffer */
const workbook = XLSX.read(res.data);
return workbook;
}
(async() => {
const wb = await workbook_dl_axios('https://sheetjs.com/pres.numbers');
/* print the first worksheet to console */
var ws = wb.Sheets[wb.SheetNames[0]];
console.log(XLSX.utils.sheet_to_csv(ws));
})();
3) Run the script:
node SheetJSAxios.js
If successful, the script will print CSV contents of the test file.
Other Platforms
Other demos show network operations in special platforms:
- React Native "Fetching Remote Data"
- NativeScript "Fetching Remote Files"
- AngularJS "Remote Files"
- Dojo Toolkit "Parsing Remote Files"
- See
read
in "Reading Files"↩ - See
sheet_to_html
in "Utilities"↩ - See
dataType
injQuery.ajax
in the official jQuery documentation.↩ - See the official
jquery.binarytransport.js
repo for more details.↩