-
Notifications
You must be signed in to change notification settings - Fork 31
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: Add CodeItem as pydantic type, update export methods and APIs #129
Conversation
Merge ProtectionsYour pull request matches the following merge protections and will not be merged until they are valid. 🟢 Enforce conventional commitWonderful, this rule succeeded.Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/
🟢 Require two reviewer for test updatesWonderful, this rule succeeded.When test data is updated, we require two reviewers
|
Signed-off-by: Matteo-Omenetti <[email protected]>
Signed-off-by: Matteo-Omenetti <[email protected]>
Signed-off-by: Matteo-Omenetti <[email protected]>
Signed-off-by: Matteo-Omenetti <[email protected]>
Signed-off-by: Matteo-Omenetti <[email protected]>
91a6650
to
378209e
Compare
Signed-off-by: Michele Dolfi <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
I think we should anyway consider having a stable enum for the programming languages. From the test about I see upcoming problems with Python vs python vs py, CSharp vs C# vs c#, etc |
Signed-off-by: Christoph Auer <[email protected]>
…o code_item
…o code_item
docling_core/types/doc/labels.py
Outdated
ADA = "Ada" | ||
AWK = "Awk" | ||
BASH = "Bash" | ||
C = "C" | ||
C_SHARP = "C#" | ||
C_PLUS_PLUS = "C++" | ||
CMAKE = "CMake" | ||
COBOL = "COBOL" | ||
CSS = "CSS" | ||
CEYLON = "Ceylon" | ||
CLOJURE = "Clojure" | ||
CRYSTAL = "Crystal" | ||
CUDA = "Cuda" | ||
CYTHON = "Cython" | ||
D = "D" | ||
DART = "Dart" | ||
DOCKERFILE = "Dockerfile" | ||
ELIXIR = "Elixir" | ||
ERLANG = "Erlang" | ||
FORTRAN = "FORTRAN" | ||
FORTH = "Forth" | ||
GO = "Go" | ||
HTML = "HTML" | ||
HASKELL = "Haskell" | ||
HAXE = "Haxe" | ||
JAVA = "Java" | ||
JAVASCRIPT = "JavaScript" | ||
JULIA = "Julia" | ||
KOTLIN = "Kotlin" | ||
LISP = "Lisp" | ||
LUA = "Lua" | ||
MATLAB = "Matlab" | ||
MOONSCRIPT = "MoonScript" | ||
NIM = "Nim" | ||
OCAML = "OCaml" | ||
OBJECTIVEC = "ObjectiveC" | ||
OCTAVE = "Octave" | ||
PHP = "PHP" | ||
PASCAL = "Pascal" | ||
PERL = "Perl" | ||
PROLOG = "Prolog" | ||
PYTHON = "Python" | ||
RACKET = "Racket" | ||
RUBY = "Ruby" | ||
RUST = "Rust" | ||
SML = "SML" | ||
SQL = "SQL" | ||
SCALA = "Scala" | ||
SCHEME = "Scheme" | ||
SWIFT = "Swift" | ||
TYPESCRIPT = "TypeScript" | ||
VISUALBASIC = "VisualBasic" | ||
XML = "XML" | ||
YAML = "YAML" | ||
BC = "bc" | ||
DC = "dc" | ||
UNKNOWN = "unknown" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
These are partially sorted alphabetically, but not completely — I would best make the sorting consistent.
…f string but variable name
fa63048
to
86cbef2
Compare
No description provided.