Paper-Writing-Tips

MLNLP社区用来帮助大家避免论文投稿小错误的整理仓库。 Paper Writing Tips

4,043

498

4,043

View on GitHub

Top Related Projects

TextBlob

9,410

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.

spaCy

31,840

💫 Industrial-strength Natural Language Processing (NLP) in Python

transformers

146,142

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

stanza

7,500

Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages

Quick Overview

The MLNLP-World/Paper-Writing-Tips repository is a comprehensive collection of resources and guidelines for writing academic papers in the fields of Machine Learning (ML) and Natural Language Processing (NLP). It aims to help researchers and students improve their paper writing skills by providing tips, templates, and best practices.

Pros

Offers a wide range of tips covering various aspects of paper writing, from structure to language
Includes templates and examples for different sections of academic papers
Regularly updated with contributions from the community
Provides guidance specific to ML and NLP fields

Cons

May not cover all specific requirements for every conference or journal
Some tips might be subjective or not universally applicable
Lacks interactive elements or tools for direct implementation of the tips
Could benefit from more extensive examples of well-written papers

As this is not a code library, we'll skip the code examples and getting started instructions sections.

Competitor Comparisons

TextBlob

9,410

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.

Pros of TextBlob

Provides a simple API for common natural language processing (NLP) tasks
Includes built-in models for sentiment analysis and part-of-speech tagging
Offers easy-to-use text processing functions like noun phrase extraction and word inflection

Cons of TextBlob

Limited to basic NLP tasks, not suitable for advanced research or complex language models
May not be as up-to-date with the latest NLP techniques compared to more specialized libraries
Lacks specific features for academic paper writing or formatting

Code Comparison

TextBlob:

from textblob import TextBlob
text = "TextBlob is simple to use."
blob = TextBlob(text)
print(blob.sentiment)

Paper-Writing-Tips:

# Title of Your Paper

## Abstract

Your abstract goes here.

## Introduction

Start your introduction...

While TextBlob focuses on providing code for NLP tasks, Paper-Writing-Tips offers markdown templates and guidelines for academic paper structure. The repositories serve different purposes, with TextBlob being a practical NLP tool and Paper-Writing-Tips being a resource for improving academic writing skills.

spaCy

31,840

💫 Industrial-strength Natural Language Processing (NLP) in Python

Pros of spaCy

Comprehensive NLP library with production-ready capabilities
Extensive documentation and community support
Optimized for performance and efficiency in processing large volumes of text

Cons of spaCy

Steeper learning curve for beginners compared to Paper-Writing-Tips
Focused on NLP tasks rather than academic writing guidance
Requires more computational resources and setup

Code Comparison

Paper-Writing-Tips (no code examples available)

spaCy:

import spacy

nlp = spacy.load("en_core_web_sm")
doc = nlp("This is a sample sentence.")
for token in doc:
    print(token.text, token.pos_, token.dep_)

Summary

While Paper-Writing-Tips is a collection of guidelines for academic writing, spaCy is a full-fledged NLP library. Paper-Writing-Tips offers valuable advice for researchers and students, whereas spaCy provides tools for text processing and analysis. The choice between them depends on whether you need writing guidance or NLP capabilities.

transformers

146,142

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Pros of transformers

Comprehensive library for state-of-the-art NLP models
Extensive documentation and community support
Regularly updated with new models and features

Cons of transformers

Steeper learning curve for beginners
Larger codebase and dependencies
Focused on model implementation rather than research writing

Code comparison

transformers:

from transformers import AutoTokenizer, AutoModel

tokenizer = AutoTokenizer.from_pretrained("bert-base-uncased")
model = AutoModel.from_pretrained("bert-base-uncased")

Paper-Writing-Tips:

# Tips for Writing ML/NLP Papers

1. Start with a clear outline
2. Use concise and precise language
3. Include relevant visualizations

The transformers repository provides a powerful toolkit for working with NLP models, while Paper-Writing-Tips offers guidance on academic writing in the ML/NLP field. transformers is more code-focused, providing implementations of various models, while Paper-Writing-Tips is a collection of markdown files with writing advice. The code examples reflect this difference, with transformers showing model usage and Paper-Writing-Tips presenting markdown-formatted tips.

stanza

7,500

Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages

Pros of Stanza

Comprehensive NLP toolkit with support for multiple languages
Well-documented API and extensive examples for easy integration
Actively maintained with regular updates and improvements

Cons of Stanza

Focused on NLP tasks, not specifically tailored for academic paper writing
Steeper learning curve for non-technical users
Requires more computational resources due to its comprehensive nature

Code Comparison

Paper-Writing-Tips is primarily a collection of markdown files with writing advice, so there's no relevant code to compare. However, here's a sample of how to use Stanza for basic NLP tasks:

import stanza

nlp = stanza.Pipeline('en')
doc = nlp("Hello world!")
for sentence in doc.sentences:
    print([word.text for word in sentence.words])

Summary

Stanza is a powerful NLP toolkit suitable for various language processing tasks, while Paper-Writing-Tips is a curated collection of advice for academic writing. Stanza offers more technical capabilities but requires programming knowledge, whereas Paper-Writing-Tips provides accessible guidance for improving writing skills without any coding requirements.

Convert designs to code with AI

Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.

Try Visual Copilot

README

Paper Writing Tips

é¡¹ç®å¨æº

æ¬é¡¹ç®çç¹è²ï¼

ååå¿çï¼åå«ä¸äºå¸¸è§çéè¯¯ï¼æ¯ä¸ªéè¯¯åéæä¾åï¼å¯ä»¥å¨å¨æåè®ºæä¹åå¿«éæµè§ã

**ç»ç¨¿å¿æ¥**ï¼åå«ä¸äºä¾åï¼æ¹ä¾¿å¿«éå®ä½æ¯å¦èªå·±çè®ºææéè¯¯ã

ç¾å®¶ä¹è¨ï¼æ´çäºä¸äºç½ç»ä¸å¬å¼çåä½èµæºï¼å¹¶ä¸å®å¨ï¼æ¬¢è¿è¡¥åï¼ï¼æ¹ä¾¿å¤§å®¶ç³»ç»å¦ä¹ ã

åè´£å£°æ

æ¬¢è¿è´¡ç®

è§£é

ååå¿ç

å¬å¼ç¬¦å·

1. æ éç¬¦å·ç¨å°åæä¸åæ¯è¡¨ç¤º

è¦ç¹: ä¸ºé¿åæ··æ·åæ¯ l åæ°å 1 ï¼åæ¯ l å¯ç¨ \ell æ¿ä»£ã

pics_1

2. æç»æçå¼ä½¿ç¨ \boldsymbolï¼Attentionï¼

è¦ç¹: æç»æçå¼ä¾å¦å¥ååºåãæ ãå¾ç ï¼ä¸å¾ä»å±ç¤ºä¸ºå¥ååºåæåµï¼

pics_2

3. \boldsymbol çéåå¯ç¨ \mathcal ï¼Attentionï¼

pics_3

4. åéå¼å°åå ç²ï¼ç©éµå¤§åå ç²

è¦ç¹: æä¸åæ¯ç¨\mathbfï¼å¸èåæ¯ç¨\boldsymbolã

pics_4

5. æ°åãææçä½¿ç¨\mathbb

pics_5

6. ä¿æåç´ ä¸éåçç¬¦å·å¯¹åº

pics_6

7. åä½é£æ ¼è¦æ£å¼ï¼é¿åç¼©å

don't æå¼åæ do nots
æææ ¼ 's å°½éè½¬åä¸º of

pics_7

8. æä¸ææ¯ç¨è¯

e.g., è¡¨ç¤º for example,
i.e., è¡¨ç¤º that is,
et al. è¡¨ç¤º and others of the same kind,
etc. è¡¨ç¤º and others,ï¼ä¸ç¨äºåä¸¾äºº
- et al. æ etc. å¨å¥æ«æ¶ï¼ä¸ç¨åæ·»å é¢å¤çå¥å·

9. è±æå¼å·

10. ä¸é´æç©ºæ ¼ "~"

Figure~\ref{} shows the model performance.
Table~\ref{} shows dataset details.
We use BERT~\cite{bert} model.
Section~\ref{} concludes this paper.

11. URL é¾æ¥

ä½¿ç¨ \url{} å½ä»¤ï¼éè¦å¯¼å¥åï¼

 \usepackage{hyperref}

12. å¼å·åªè¡¨ç¤ºæè°ï¼ä¸è¡¨ç¤ºå¼ç¨ï¼Attentionï¼

å¼ç¨çè¡¨è¿°èèä½¿ç¨æä½ \textit{} èä¸æ¯å¼å·ã

13. éåä¸ªåæ¯çåéå

å¬å¼ä¸ç softmaxï¼projï¼enc çè¶è¿ä¸ä¸ªåæ¯çåéæç¬¦å·ï¼ä½¿ç¨æ£æåä½ï¼å³ä½¿ç¨ \textrm æ \textit å½ä»¤ã

14. ä½¿ç¨å½æ°å½ä»¤

è®¸å¤å½æ°åç¬¦å·æç°æçå½ä»¤ï¼ä¾å¦ï¼\arg{}ï¼\max{}ï¼\sin{}ï¼\tanh{}ï¼\infï¼ \det{}ï¼ \exp{}.

15. å¬å¼ä¸çæ¬å·ï¼åºéè¿\leftï¼\rightè¿è¡æ è®°

å¦ \left(\right), \left{\right}, \left<\right>, \left|\right|çã
æ¬å·ä¸çåå²éè¿\middleå®ç°ã

Latexä»£ç å¦ä¸ï¼

\begin{gather}
 \bold{s} = \left(\sum_{i=0}^{N-1}{\alpha_{i} \bold{h}_i}\right) + \bold{h}_N\\
 \bold{s} = (\sum_{i=0}^{N-1}{\alpha_{i} \bold{h}_i}) + \bold{h}_N \\
\end{gather}

\begin{gather}
 \left\{ x \middle| x\ne\frac{1}{2}\right\} \\ 
 \{ x | x\ne\frac{1}{2}\}
\end{gather}

16. ä½¿ç¨ align è¡¨ç¤ºä¸ç»å¬å¼ï¼çå·å¯¹é½

ä½¿ç¨ align è¡¨ç¤ºä¸ç»å¬å¼ï¼çå·å¯¹é½ã

Latexä»£ç å¦ä¸ï¼

\begin{gather}
 E = m c^2 \\
 C = B \log_2\left(1+\frac{S}{N}\right)
\end{gather}

\begin{align}
 E &= m c^2 \\
 C &= B \log_2\left(1+\frac{S}{N}\right)
\end{align}

17. åªå¯¹referçå¬å¼ä¸å ç¼å·ï¼Attentionï¼

æ¨èï¼åªå¯¹referçå¬å¼å ç¼å·ï¼\nonumberå»ç¼å·ã

Latexä»£ç å¦ä¸ï¼

\begin{equation}
 E = m c^2 
\end{equation}

\begin{equation}
 E = m c^2 \nonumber
\end{equation}

è¡¨æ ¼å¾ç

18. ä½¿ç¨Booktabsç»å¶æ´å¥½ççè¡¨æ ¼

ç»å¶è¡¨æ ¼æ¶ï¼ä½¿ç¨ \usepackage{booktabs}ï¼ä»èåå© \toprule, \bottomrule, \midrule, \cmidrule å½ä»¤ï¼ç»åºå¥½ççåéçº¿ã

Latexä»£ç å¦ä¸ï¼

% Example of a table with booktabs from https://nhigham.com/2019/11/19/better-latex-tables-with-booktabs/.
% First version of table.
\begin{table}[htbp]
 \centering
 \begin{tabular}{|l|c|c|c|c|c|l|}
    \hline
    & \multicolumn{3}{c|}{E} & \multicolumn{3}{c|}{F}\\
    \hline
                & $mv$  & Rel.~err & Time    & $mv$  & Rel.~err & Time   \\\hline
    A    & 11034 & 1.3e-7 & 3.9 & 15846 & 2.7e-11 & 5.6 \\
    B & 21952 & 1.3e-7 & 6.2 & 31516 & 2.7e-11 & 8.8 \\
    C & 15883 & 5.2e-8 & 7.1 & 32023 & 1.1e-11 & 1.4 \\
    D  & 11180 & 8.0e-9 & 4.3 & 17348 & 1.5e-11 & 6.6 \\
    \hline
 \end{tabular}
 \caption{Without booktabs.}
 \label{tab:without-booktabs}
\end{table}

% Second version of table, with booktabs.
\begin{table}[htbp]
 \centering
 \begin{tabular}{lcccccl}\toprule
    & \multicolumn{3}{c}{E} & \multicolumn{3}{c}{F}
    \\\cmidrule(lr){2-4}\cmidrule(lr){5-7}
             & $mv$  & Rel.~err & Time    & $mv$  & Rel.~err & Time\\\midrule
    A    & 11034 & 1.3e-7 & 3.9 & 15846 & 2.7e-11 & 5.6 \\
    B & 21952 & 1.3e-7 & 6.2 & 31516 & 2.7e-11 & 8.8 \\
    C & 15883 & 5.2e-8 & 7.1 & 32023 & 1.1e-11 & 1.4\\
    D  & 11180 & 8.0e-9 & 4.3 & 17348 & 1.5e-11 & 6.6 
    \\\bottomrule
 \end{tabular}
 \caption{With booktabs.}
 \label{tab:with-booktabs}
\end{table}

19. ç« èãè¡¨æ ¼ãå¾ççå¼ç¨

ç« èãè¡¨æ ¼ãå¾çä½¿ç¨\label{...}å®ä¹åï¼éè¿\ref{...}èªå¨å¼ç¨è·³è½¬ã
å¯¹åå¾æåè¡¨çå¼ç¨å¯ä»¥ä½¿ç¨Figure~\ref{fig:figure}(a)æ¥è¡¨ç¤ºã

20. ä¸è¦æå¾è¡¨ä¸çCaptionå¨æ£æä¸å¤è¿°

è¯´æï¼Captionï¼æ¯ç¨æ¥åâè¿ä¸ªè¡¨æ ¼æ¯ä»ä¹âçã
æ£ææ¯ç¨æ¥åâè¿ä¸ªè¡¨æ ¼è¯´æäºä»ä¹âçã

22. è¡¨æ ¼å¤§å°è°æ´

ç¨ \centering å±ä¸ï¼ç¨\smallï¼\scriptsizeï¼\footnotesizeï¼\tiny è°æ´åå·
ç¨\setlength{\tabcolsep}{8pt} è°æ´åé´è·
ç¨ p{2cm} åºå®åå®½
ç¨\multirowï¼\multicolumn åå¹¶ååæ ¼

23. ç¢éå¾ï¼å¾ååºä½¿ç¨ç¢éå¾ï¼å¦PDFæ ¼å¼ï¼

ä½¿ç¨Adobe illustratorãOmniGraffleçè½¯ä»¶ç»å¶ååä¸ºç¢éå¾
ä½¿ç¨Matplotlibç»å¶ååå¨: plt.savefig('draw.pdf')
å¨LaTeXä¸ä½¿ç¨pgfplotsç´æ¥ç»å¶

24. å¾çåä½å¤§å°ä»äºæ£æåä½ä¸captionä¹é´

å»ºè®®å¾ä¸åä½å¤§å°ä¿æä¸è´

25. è®ºæä¸å¾çä¸æåè¯´æåå·åºåæ£ææåå¤§å°ç¸å½

å¾çä¸æååå·å¤§å°ä¸å®å¤ªå¤§

26. å¾è¡¨è®¾è®¡åºéç¨äºé»ç½æå°

å¯¹é»ç½æå°åå¥½ï¼ä¸è¦ä»¥é¢è²ä½ä¸ºæä»£å¾ç¤ºä¸çº¿æ¡çå¯ä¸ç¹å¾ï¼å¯ä½¿ç¨å®çº¿/èçº¿ ï¼äº®/æï¼ä¸åçº¿å½¢çã

27. å¾çé£æ ¼ä¿æç®æ´ç¾è§

ä¸è¦ä½¿ç¨è¿å¤çé¢è²ç§ç±»ï¼é¿åè¿äº®çé¢è²
ä½¿ç¨ç®æ´çå¾ç¤ºï¼å°½éå°ç¨æåæè¿°ï¼ä¾åé¤å¤ï¼
åæ ·åè½æ¨¡åä½¿ç¨ç»ä¸æ ¼å¼
ç®å¤´èµ°ååºè¶äºåä¸ä¸ªæ¹å

éè¯ç¨è¯

28. æ³¨æè¿è¯ç¬¦çè¯æ§

ä¸è¬è¿è¯ç¬¦ä¸ï¼æåä¸ä¸ªè¯æ¯åè¯çï¼è¿èµ·æ¥æ¯å½¢å®¹è¯è¯æ§ï¼

pic_29

æåä¸ä¸ªè¯æ¯å¨è¯çï¼è¿èµ·æ¥æ¯å¨è¯è¯æ§ã

pic_29

29. è¯æ§æéç¹

First, Secondlyï¼åä¸ºå¯è¯
trainingï¼ testï¼validationï¼åä¸ºåè¯

pic_30

30. ç¼©åç¬¦åä½¿ç¨ä¹ æ¯

ç¬¦åä¹ æ¯ï¼ä¸æåºèå°½éä¸è´CNNï¼LSTMï¼FEVERï¼ConceptNetï¼SQuADï¼BiDAFï¼FEVER scoreï¼Wikipediaã
åæ¬¡åºç°æ¶ï¼å¨ç§°å¨åï¼ç¼©åå¨åï¼æç¼©åå¨åï¼ç¨äºæ³¨éçcitationå¨åãgraph attention network (GAT)ï¼pre-trained language model (PLM)ï¼BERT~\citep{BERT}ã
é¢ååãä»»å¡åãææ çä¸è¬ä¸éè¦å¤§åï¼å¦ natural language processing, question answering, accuracy, macro-F1 score.

pic_31

31. æ³¨æåå¤æ°

å°¤å¶æ¯ä¸è§ååå¤æ°ååãä¸å¯æ°åè¯ã

pic_32

32. a/an è·çåé³é³ç´ èµ°

pic_33

33. theçä½¿ç¨

æ³¨æï¼ä¸è¬ä¸ä¼ç¬ç«åºç°ï¼ä¸ç¨å è¯ï¼å¯æ°åè¯åæ°ï¼è¦ä¹å theç¹æï¼è¦ä¹å å¤æ°æ³æã

pic_34

34. æ¶æï¼ä»¥ä¸è¬ç°å¨æ¶ä¸ºä¸»ï¼Attentionï¼

pic_35

35. é¿åç»å¯¹åè¡¨è¿°ã

ä½¿ç¨straightforwardæ¿æ¢obvious
ä½¿ç¨generallyãusuallyãoftenæ¿æ¢always
ä½¿ç¨rareæ¿æ¢never
ä½¿ç¨alleviateãrelieveæ¿æ¢avoidãeliminate

36. é¿åä¸äºæ¨¡ç³çè¡¨è¿°ï¼æ¯å¦ï¼meaning, semantic, betterçã

å¥åè¡¨è¿°

38. é¿åè¿å¤è´´æ ç¾ï¼æ¯å¦å¨è°è®ºææå¥½æ¶ã

æåºçæ¹æ³å°åºæ¹åäºåªéï¼æ¯ä»ä¹å¯¼è´çè¿ä¸ªç»æï¼

40. è§å¯/åç°ï¼åè®¾ï¼æ¹æ³ï¼ææï¼ä¸è¦æ··çè¯´ã

æ®µè½å¸å±

å¯éï¼å¯ä»¥å°è¯å¨è¯¥æ®µè¯çæåï¼æ·»å \looseness=-1ï¼ææ¶å¯ä»¥å¨ä¸å é¤æåä¸è¡çæåµä¸ï¼å°æåä¸è¡çä¸ªå«åè¯âæ¤ä¸å»âã

pic_42

åèæç®

42. åèæç®å¼ç¨éè¦ææ¥æ¯å¦å¨å¥åä¸åæå

è¦ç¹ï¼å¼ç¨ä½¿ç¨\citep{}ï¼ä½ä¸ºæå¥è¯ï¼æ\citet{}ï¼ä½ä¸ºå¥åä¸»è¦æåå¦ä¸»è¯ãå®¾è¯çã

pic_43

43. å°½éå¼ç¨åè¡¨ççæ¬èéarXivçæ¬ã

ä¼æ¾å¾æ£è§ä¸äº

pic_44

44. å¼ç¨æ¡ç®çæ ¼å¼å°½éååä¸è´

pic_45

å³äºç§æè±è¯ä¹¦åä¹ æ¯

ä¾å¦ grammarly, writefull.

4. æ¨¡åååå¤§å°åä¿æä¸è´ï¼å¦BERTï¼ELECTRAï¼é¿åBertï¼Electraï¼electraæ··åä½¿ç¨ã

5. ä¾å¥ãä¾åèèç¨æä½

8. Aåançåºå«å¨äºåé³ï¼an LSTM cell, an F/H/L/M/N/S/X, a U.

10. ä½¿ç¨babelå®ç°åè¯æé³æ é³èæ¢è¡ï¼hyphenation patternsï¼çææï¼å³`\usepackage[english]{babel}`

å³äºå¾çï¼

11. å¾çåé¨çåä½åºç»ä¸ä¸è·æ£ææåå¤§å°ä¸è´ã

13. å¾çéå¸¸å¨æ¯ä¸é¡µçæä¸æ¹æä¸é´ï¼èä¸æ¯æä¸æ¹ã

16. ä¸å¾ä½¿ç¨è¿å¤çé¢è²ç§ç±»ï¼é¢è²æå¥½ä¸è¦é«äºåç§ã

17. å¾çä½¿ç¨ç¢éå¾ã

å³äºå¼ç¨ï¼

21. å¼ç¨æ è®°çéåï¼

å¼ç¨å¨æåå¤ï¼parentï¼ï¼ä½¿ç¨ \citeã
å¼ç¨å¨æååï¼within textï¼
- ACL/NAACL/EMNLPæ¨¡æ¿ä½¿ç¨\citet{...}ï¼
- COLINGæ¨¡æ¿ä½¿ç¨\newcite{...}ï¼
- AAAI/IJCAIæ¨¡æ¿ä½¿ç¨\citeauthor{...} \shortcite{...}ï¼
- IEEEæ¨¡çï¼\citeauthor{...}~(\citeyear{...})
ææï¼(Zhang et al. 2020) vs. Zhang et al. (2020)