{"data":{"id":"10.48550/arxiv.1910.11555","type":"dois","attributes":{"doi":"10.48550/arxiv.1910.11555","prefix":"10.48550","suffix":"arxiv.1910.11555","identifiers":[{"identifier":"1910.11555","identifierType":"arXiv"}],"alternateIdentifiers":[{"alternateIdentifierType":"arXiv","alternateIdentifier":"1910.11555"}],"creators":[{"name":"Sun, Zhiqing","nameType":"Personal","givenName":"Zhiqing","familyName":"Sun","affiliation":[],"nameIdentifiers":[]},{"name":"Li, Zhuohan","nameType":"Personal","givenName":"Zhuohan","familyName":"Li","affiliation":[],"nameIdentifiers":[]},{"name":"Wang, Haoqing","nameType":"Personal","givenName":"Haoqing","familyName":"Wang","affiliation":[],"nameIdentifiers":[]},{"name":"Lin, Zi","nameType":"Personal","givenName":"Zi","familyName":"Lin","affiliation":[],"nameIdentifiers":[]},{"name":"He, Di","nameType":"Personal","givenName":"Di","familyName":"He","affiliation":[],"nameIdentifiers":[]},{"name":"Deng, Zhi-Hong","nameType":"Personal","givenName":"Zhi-Hong","familyName":"Deng","affiliation":[],"nameIdentifiers":[]}],"titles":[{"title":"Fast Structured Decoding for Sequence Models"}],"publisher":"arXiv","container":{},"publicationYear":2019,"subjects":[{"lang":"en","subject":"Machine Learning (cs.LG)","subjectScheme":"arXiv"},{"lang":"en","subject":"Computation and Language (cs.CL)","subjectScheme":"arXiv"},{"lang":"en","subject":"Machine Learning (stat.ML)","subjectScheme":"arXiv"},{"subject":"FOS: Computer and information sciences","subjectScheme":"Fields of Science and Technology (FOS)"},{"subject":"FOS: Computer and information sciences","schemeUri":"http://www.oecd.org/science/inno/38235147.pdf","subjectScheme":"Fields of Science and Technology (FOS)"}],"contributors":[],"dates":[{"date":"2019-10-25T07:32:52Z","dateType":"Submitted","dateInformation":"v1"},{"date":"2019-10-28T00:08:31Z","dateType":"Updated","dateInformation":"v1"},{"date":"2020-01-09T08:25:23Z","dateType":"Submitted","dateInformation":"v2"},{"date":"2020-01-10T01:06:52Z","dateType":"Updated","dateInformation":"v2"},{"date":"2019-10","dateType":"Available","dateInformation":"v1"},{"date":"2019","dateType":"Issued"}],"language":null,"types":{"ris":"GEN","bibtex":"misc","citeproc":"article","schemaOrg":"CreativeWork","resourceType":"Article","resourceTypeGeneral":"Preprint"},"relatedIdentifiers":[],"relatedItems":[],"sizes":[],"formats":[],"version":"2","rightsList":[{"rights":"arXiv.org perpetual, non-exclusive license","rightsUri":"http://arxiv.org/licenses/nonexclusive-distrib/1.0/"}],"descriptions":[{"description":"Autoregressive sequence models achieve state-of-the-art performance in domains like machine translation. However, due to the autoregressive factorization nature, these models suffer from heavy latency during inference. Recently, non-autoregressive sequence models were proposed to reduce the inference time. However, these models assume that the decoding process of each token is conditionally independent of others. Such a generation process sometimes makes the output sentence inconsistent, and thus the learned non-autoregressive models could only achieve inferior accuracy compared to their autoregressive counterparts. To improve then decoding consistency and reduce the inference cost at the same time, we propose to incorporate a structured inference module into the non-autoregressive models. Specifically, we design an efficient approximation for Conditional Random Fields (CRF) for non-autoregressive sequence models, and further propose a dynamic transition technique to model positional contexts in the CRF. Experiments in machine translation show that while increasing little latency (8~14ms), our model could achieve significantly better translation performance than previous non-autoregressive models on different translation datasets. In particular, for the WMT14 En-De dataset, our model obtains a BLEU score of 26.80, which largely outperforms the previous non-autoregressive baselines and is only 0.61 lower in BLEU than purely autoregressive models.","descriptionType":"Abstract"},{"description":"Accepted to NeurIPS 2019 (Previous title: Structured Decoding for Non-Autoregressive Machine Translation)","descriptionType":"Other"}],"geoLocations":[],"fundingReferences":[],"xml":"PD94bWwgdmVyc2lvbj0iMS4wIiBlbmNvZGluZz0idXRmLTgiPz4KPHJlc291cmNlIHhtbG5zPSJodHRwOi8vZGF0YWNpdGUub3JnL3NjaGVtYS9rZXJuZWwtNCIgeG1sbnM6eHNpPSJodHRwOi8vd3d3LnczLm9yZy8yMDAxL1hNTFNjaGVtYS1pbnN0YW5jZSIgeHNpOnNjaGVtYUxvY2F0aW9uPSJodHRwOi8vZGF0YWNpdGUub3JnL3NjaGVtYS9rZXJuZWwtNCBodHRwOi8vc2NoZW1hLmRhdGFjaXRlLm9yZy9tZXRhL2tlcm5lbC00LjMvbWV0YWRhdGEueHNkIj4KICA8aWRlbnRpZmllciBpZGVudGlmaWVyVHlwZT0iRE9JIj4xMC40ODU1MC9BUlhJVi4xOTEwLjExNTU1PC9pZGVudGlmaWVyPgogIDxhbHRlcm5hdGVJZGVudGlmaWVycz4KICAgIDxhbHRlcm5hdGVJZGVudGlmaWVyIGFsdGVybmF0ZUlkZW50aWZpZXJUeXBlPSJhclhpdiI+MTkxMC4xMTU1NTwvYWx0ZXJuYXRlSWRlbnRpZmllcj4KICA8L2FsdGVybmF0ZUlkZW50aWZpZXJzPgogIDxjcmVhdG9ycz4KICAgIDxjcmVhdG9yPgogICAgICA8Y3JlYXRvck5hbWUgbmFtZVR5cGU9IlBlcnNvbmFsIj5TdW4sIFpoaXFpbmc8L2NyZWF0b3JOYW1lPgogICAgICA8Z2l2ZW5OYW1lPlpoaXFpbmc8L2dpdmVuTmFtZT4KICAgICAgPGZhbWlseU5hbWU+U3VuPC9mYW1pbHlOYW1lPgogICAgPC9jcmVhdG9yPgogICAgPGNyZWF0b3I+CiAgICAgIDxjcmVhdG9yTmFtZSBuYW1lVHlwZT0iUGVyc29uYWwiPkxpLCBaaHVvaGFuPC9jcmVhdG9yTmFtZT4KICAgICAgPGdpdmVuTmFtZT5aaHVvaGFuPC9naXZlbk5hbWU+CiAgICAgIDxmYW1pbHlOYW1lPkxpPC9mYW1pbHlOYW1lPgogICAgPC9jcmVhdG9yPgogICAgPGNyZWF0b3I+CiAgICAgIDxjcmVhdG9yTmFtZSBuYW1lVHlwZT0iUGVyc29uYWwiPldhbmcsIEhhb3Fpbmc8L2NyZWF0b3JOYW1lPgogICAgICA8Z2l2ZW5OYW1lPkhhb3Fpbmc8L2dpdmVuTmFtZT4KICAgICAgPGZhbWlseU5hbWU+V2FuZzwvZmFtaWx5TmFtZT4KICAgIDwvY3JlYXRvcj4KICAgIDxjcmVhdG9yPgogICAgICA8Y3JlYXRvck5hbWUgbmFtZVR5cGU9IlBlcnNvbmFsIj5MaW4sIFppPC9jcmVhdG9yTmFtZT4KICAgICAgPGdpdmVuTmFtZT5aaTwvZ2l2ZW5OYW1lPgogICAgICA8ZmFtaWx5TmFtZT5MaW48L2ZhbWlseU5hbWU+CiAgICA8L2NyZWF0b3I+CiAgICA8Y3JlYXRvcj4KICAgICAgPGNyZWF0b3JOYW1lIG5hbWVUeXBlPSJQZXJzb25hbCI+SGUsIERpPC9jcmVhdG9yTmFtZT4KICAgICAgPGdpdmVuTmFtZT5EaTwvZ2l2ZW5OYW1lPgogICAgICA8ZmFtaWx5TmFtZT5IZTwvZmFtaWx5TmFtZT4KICAgIDwvY3JlYXRvcj4KICAgIDxjcmVhdG9yPgogICAgICA8Y3JlYXRvck5hbWUgbmFtZVR5cGU9IlBlcnNvbmFsIj5EZW5nLCBaaGktSG9uZzwvY3JlYXRvck5hbWU+CiAgICAgIDxnaXZlbk5hbWU+WmhpLUhvbmc8L2dpdmVuTmFtZT4KICAgICAgPGZhbWlseU5hbWU+RGVuZzwvZmFtaWx5TmFtZT4KICAgIDwvY3JlYXRvcj4KICA8L2NyZWF0b3JzPgogIDx0aXRsZXM+CiAgICA8dGl0bGU+RmFzdCBTdHJ1Y3R1cmVkIERlY29kaW5nIGZvciBTZXF1ZW5jZSBNb2RlbHM8L3RpdGxlPgogIDwvdGl0bGVzPgogIDxwdWJsaXNoZXI+YXJYaXY8L3B1Ymxpc2hlcj4KICA8cHVibGljYXRpb25ZZWFyPjIwMTk8L3B1YmxpY2F0aW9uWWVhcj4KICA8c3ViamVjdHM+CiAgICA8c3ViamVjdCB4bWw6bGFuZz0iZW4iIHN1YmplY3RTY2hlbWU9ImFyWGl2Ij5NYWNoaW5lIExlYXJuaW5nIChjcy5MRyk8L3N1YmplY3Q+CiAgICA8c3ViamVjdCB4bWw6bGFuZz0iZW4iIHN1YmplY3RTY2hlbWU9ImFyWGl2Ij5Db21wdXRhdGlvbiBhbmQgTGFuZ3VhZ2UgKGNzLkNMKTwvc3ViamVjdD4KICAgIDxzdWJqZWN0IHhtbDpsYW5nPSJlbiIgc3ViamVjdFNjaGVtZT0iYXJYaXYiPk1hY2hpbmUgTGVhcm5pbmcgKHN0YXQuTUwpPC9zdWJqZWN0PgogICAgPHN1YmplY3Qgc3ViamVjdFNjaGVtZT0iRmllbGRzIG9mIFNjaWVuY2UgYW5kIFRlY2hub2xvZ3kgKEZPUykiPkZPUzogQ29tcHV0ZXIgYW5kIGluZm9ybWF0aW9uIHNjaWVuY2VzPC9zdWJqZWN0PgogIDwvc3ViamVjdHM+CiAgPGRhdGVzPgogICAgPGRhdGUgZGF0ZVR5cGU9IlN1Ym1pdHRlZCIgZGF0ZUluZm9ybWF0aW9uPSJ2MSI+MjAxOS0xMC0yNVQwNzozMjo1Mlo8L2RhdGU+CiAgICA8ZGF0ZSBkYXRlVHlwZT0iVXBkYXRlZCIgZGF0ZUluZm9ybWF0aW9uPSJ2MSI+MjAxOS0xMC0yOFQwMDowODozMVo8L2RhdGU+CiAgICA8ZGF0ZSBkYXRlVHlwZT0iU3VibWl0dGVkIiBkYXRlSW5mb3JtYXRpb249InYyIj4yMDIwLTAxLTA5VDA4OjI1OjIzWjwvZGF0ZT4KICAgIDxkYXRlIGRhdGVUeXBlPSJVcGRhdGVkIiBkYXRlSW5mb3JtYXRpb249InYyIj4yMDIwLTAxLTEwVDAxOjA2OjUyWjwvZGF0ZT4KICAgIDxkYXRlIGRhdGVUeXBlPSJBdmFpbGFibGUiIGRhdGVJbmZvcm1hdGlvbj0idjEiPjIwMTktMTA8L2RhdGU+CiAgPC9kYXRlcz4KICA8cmVzb3VyY2VUeXBlIHJlc291cmNlVHlwZUdlbmVyYWw9IlByZXByaW50Ij5BcnRpY2xlPC9yZXNvdXJjZVR5cGU+CiAgPHZlcnNpb24+MjwvdmVyc2lvbj4KICA8cmlnaHRzTGlzdD4KICAgIDxyaWdodHMgcmlnaHRzVVJJPSJodHRwOi8vYXJ4aXYub3JnL2xpY2Vuc2VzL25vbmV4Y2x1c2l2ZS1kaXN0cmliLzEuMC8iPmFyWGl2Lm9yZyBwZXJwZXR1YWwsIG5vbi1leGNsdXNpdmUgbGljZW5zZTwvcmlnaHRzPgogIDwvcmlnaHRzTGlzdD4KICA8ZGVzY3JpcHRpb25zPgogICAgPGRlc2NyaXB0aW9uIGRlc2NyaXB0aW9uVHlwZT0iQWJzdHJhY3QiPkF1dG9yZWdyZXNzaXZlIHNlcXVlbmNlIG1vZGVscyBhY2hpZXZlIHN0YXRlLW9mLXRoZS1hcnQgcGVyZm9ybWFuY2UgaW4gZG9tYWlucyBsaWtlIG1hY2hpbmUgdHJhbnNsYXRpb24uIEhvd2V2ZXIsIGR1ZSB0byB0aGUgYXV0b3JlZ3Jlc3NpdmUgZmFjdG9yaXphdGlvbiBuYXR1cmUsIHRoZXNlIG1vZGVscyBzdWZmZXIgZnJvbSBoZWF2eSBsYXRlbmN5IGR1cmluZyBpbmZlcmVuY2UuIFJlY2VudGx5LCBub24tYXV0b3JlZ3Jlc3NpdmUgc2VxdWVuY2UgbW9kZWxzIHdlcmUgcHJvcG9zZWQgdG8gcmVkdWNlIHRoZSBpbmZlcmVuY2UgdGltZS4gSG93ZXZlciwgdGhlc2UgbW9kZWxzIGFzc3VtZSB0aGF0IHRoZSBkZWNvZGluZyBwcm9jZXNzIG9mIGVhY2ggdG9rZW4gaXMgY29uZGl0aW9uYWxseSBpbmRlcGVuZGVudCBvZiBvdGhlcnMuIFN1Y2ggYSBnZW5lcmF0aW9uIHByb2Nlc3Mgc29tZXRpbWVzIG1ha2VzIHRoZSBvdXRwdXQgc2VudGVuY2UgaW5jb25zaXN0ZW50LCBhbmQgdGh1cyB0aGUgbGVhcm5lZCBub24tYXV0b3JlZ3Jlc3NpdmUgbW9kZWxzIGNvdWxkIG9ubHkgYWNoaWV2ZSBpbmZlcmlvciBhY2N1cmFjeSBjb21wYXJlZCB0byB0aGVpciBhdXRvcmVncmVzc2l2ZSBjb3VudGVycGFydHMuIFRvIGltcHJvdmUgdGhlbiBkZWNvZGluZyBjb25zaXN0ZW5jeSBhbmQgcmVkdWNlIHRoZSBpbmZlcmVuY2UgY29zdCBhdCB0aGUgc2FtZSB0aW1lLCB3ZSBwcm9wb3NlIHRvIGluY29ycG9yYXRlIGEgc3RydWN0dXJlZCBpbmZlcmVuY2UgbW9kdWxlIGludG8gdGhlIG5vbi1hdXRvcmVncmVzc2l2ZSBtb2RlbHMuIFNwZWNpZmljYWxseSwgd2UgZGVzaWduIGFuIGVmZmljaWVudCBhcHByb3hpbWF0aW9uIGZvciBDb25kaXRpb25hbCBSYW5kb20gRmllbGRzIChDUkYpIGZvciBub24tYXV0b3JlZ3Jlc3NpdmUgc2VxdWVuY2UgbW9kZWxzLCBhbmQgZnVydGhlciBwcm9wb3NlIGEgZHluYW1pYyB0cmFuc2l0aW9uIHRlY2huaXF1ZSB0byBtb2RlbCBwb3NpdGlvbmFsIGNvbnRleHRzIGluIHRoZSBDUkYuIEV4cGVyaW1lbnRzIGluIG1hY2hpbmUgdHJhbnNsYXRpb24gc2hvdyB0aGF0IHdoaWxlIGluY3JlYXNpbmcgbGl0dGxlIGxhdGVuY3kgKDh+MTRtcyksIG91ciBtb2RlbCBjb3VsZCBhY2hpZXZlIHNpZ25pZmljYW50bHkgYmV0dGVyIHRyYW5zbGF0aW9uIHBlcmZvcm1hbmNlIHRoYW4gcHJldmlvdXMgbm9uLWF1dG9yZWdyZXNzaXZlIG1vZGVscyBvbiBkaWZmZXJlbnQgdHJhbnNsYXRpb24gZGF0YXNldHMuIEluIHBhcnRpY3VsYXIsIGZvciB0aGUgV01UMTQgRW4tRGUgZGF0YXNldCwgb3VyIG1vZGVsIG9idGFpbnMgYSBCTEVVIHNjb3JlIG9mIDI2LjgwLCB3aGljaCBsYXJnZWx5IG91dHBlcmZvcm1zIHRoZSBwcmV2aW91cyBub24tYXV0b3JlZ3Jlc3NpdmUgYmFzZWxpbmVzIGFuZCBpcyBvbmx5IDAuNjEgbG93ZXIgaW4gQkxFVSB0aGFuIHB1cmVseSBhdXRvcmVncmVzc2l2ZSBtb2RlbHMuPC9kZXNjcmlwdGlvbj4KICAgIDxkZXNjcmlwdGlvbiBkZXNjcmlwdGlvblR5cGU9Ik90aGVyIj5BY2NlcHRlZCB0byBOZXVySVBTIDIwMTkgKFByZXZpb3VzIHRpdGxlOiBTdHJ1Y3R1cmVkIERlY29kaW5nIGZvciBOb24tQXV0b3JlZ3Jlc3NpdmUgTWFjaGluZSBUcmFuc2xhdGlvbik8L2Rlc2NyaXB0aW9uPgogIDwvZGVzY3JpcHRpb25zPgo8L3Jlc291cmNlPg==","url":"https://arxiv.org/abs/1910.11555","contentUrl":null,"metadataVersion":0,"schemaVersion":"http://datacite.org/schema/kernel-4","source":"mds","isActive":true,"state":"findable","reason":null,"viewCount":0,"viewsOverTime":[],"downloadCount":0,"downloadsOverTime":[],"referenceCount":0,"citationCount":0,"citationsOverTime":[],"partCount":0,"partOfCount":0,"versionCount":0,"versionOfCount":0,"created":"2022-02-27T08:03:03.000Z","registered":"2022-02-27T08:03:04.000Z","published":"2019","updated":"2022-02-27T08:03:04.000Z"},"relationships":{"client":{"data":{"id":"arxiv.content","type":"clients"}},"provider":{"data":{"id":"arxiv","type":"providers"}},"media":{"data":{"id":"10.48550/arxiv.1910.11555","type":"media"}},"references":{"data":[]},"citations":{"data":[]},"parts":{"data":[]},"partOf":{"data":[]},"versions":{"data":[]},"versionOf":{"data":[]}}}}